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48000 AND 52920, NOVEL HUMAN CALCIUM CHANNELS AND USES THEREOF 



Related Applications 

This application claims priority to U.S. Provisional Patent Application No. 
5 60/214,176 filed on June 26, 2000, incorporated herein in its entirety by reference. 



Background of the Invention 

Calcium signaling has been implicated in the regulation of a variety of cellular 
responses, such as growth and differentiation. There are two general methods by which 

10 intracellular concentrations of calcium ions may be increased: calcium ions may be 

freed from intracellular stores, transported by specific membrane channels in the storage 
organelle, or calcium ions may be brought into the cell from the extracellular milieu 
through the use of specific channels in the cellular membrane. In the situation in which 
the intracellular stores of calcium have been depleted, a specific type of calcium 

15 channel, termed a 'capacitative calcium channel' or a 'store-operated calcium channel' 
(SOC), is activated in the plasma membrane to import calcium ions from the 
extracellular environment to the cytosol (see Putney and McKay (1 999) BioEssays 
21 :38-46). Calcium may also enter the cell via receptor-stimulated cation channels (see 
Hofmann et aL (2000) J. Mol Med. 78:14-25). 

20 There is no single electrophysological profile characteristic of the calcium 

channel family; rather, a wide array of single channel conductances, cation selectivity, 
and current properties have been observed for different channels. Further, in several 
instances it has been demonstrated that homo- or hetero-polymerization of the channel 
molecule may occur, further changing the channel properties from those of the single 

25 molecule. In general, though, these channels function similarly, in that they are calcium 
ion-permeable cation channels which become activated after agonist binding to a G 
protein-coupled receptor. 

Members of the capacitative calcium channel family include the calcium release- 
activated calcium current (CRAC) (Hoth and Penner (1992) Nature 355: 353-355), 

30 calcium release-activated non- selective cation current (CRANC) (Krause et ah (1996) J, 
Biol Chem. 271 : 32523-32528), and the transient receptor potential (TRP) proteins 
TRP1, TRP2, TRP4, and TRP5. Depletion of intracellular calcium stores activate these 
channels by a mechanism which is yet undefined, but which has been demonstrated to 
involve a diffusible factor using studies in which calcium stores were artificially 

35 depleted (e.g., by the introduction of chelators into the cell, by activating phospholipase 
C y , or by inhibiting those enzymes responsible for pumping calcium ions into the stores 
or those enzymes responsible for maintaining resting intracellular calcium ion 
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concentrations) (Putney, J.W., (1986) Cell Calcium 7:1-12; Putney, J.W. (1990) Cell 
Calcium 11:611-624). 

Recently, it has been elucidated that three TRP family members, TRP3, TRP6, 
and a mouse homologue, TRP7, form a sub-family of receptors that are activated in a 
5 calcium store-depletion independent manner. TRP3 and TRP6 are activated by 
diacylglycerols in a membrane delimited manner (Hofinann et al (1999) Nature 
397:259-263). Similarly, murine TRP7 is activated via diacylglycerol stimulation by G q 
protein coupled receptors (Okada et al (1999) J. Biol Chem. 274:27359-27370). 

The TRP channel family is one of the best characterized calcium channel protein 

10 families. These channels include transient receptor potential protein and homologues 
thereof (to date, seven TRP homologues and splice variants have been identified in a 
variety of organisms), the vanilloid receptor subtype I (also known as the capsaicin 
receptor); the stretch-inhibitable non-selective cation channel (SIC); the olfactory, 
mechanosensitive channel; the insulin-like growth factor I-regulated calcium channel; 

15 the vitamin D-responsive apical, epithelial calcium channel (ECaC); melastatin; and the 
polycystic kidney disease protein family (see, e.g., Montell and Rubin (1989) Neuron 
2:1313-1323; Caterina et al (1997) Nature 389: 816-824; Suzuki et al (1999; J. Biol 
Chem. 274: 6330-6335; Kiselyov et al (1998) Nature 396: 478-482; Hoenderop et al 
(1999)./. Biol Chem. 274: 8375-8378; and Chen et al (1999) Nature 401(6751): 383-6). 

20 Each of these molecules is 700 or more amino acids in length, and shares certain 
conserved structural features. Predominant among these structural features are six 
transmembrane domains, with an additional hydrophobic loop present between the fifth 
and sixth transmembrane domains. It is believed that this loop is integral to the activity 
of the pore of the channel formed upon membrane insertion (Hardie and Minke (1993) 

25 Trends Neurosci 16: 371-376). TRP channel proteins also include one or more ankyrin 
domains. Although found in disparate tissues and organisms, members of the TRP 
channel protein family all serve to transduce signals by means of calcium entry into 
cells, particularly pain signals (see, e.g., McClesky and Gold (1999) Annu. Rev. Physiol 
61 : 835-856), light signals (Hardie and Minke, supra), or olfactory signals (Colbert et al. 

30 (1997) J. Neurosci 17(21): 8259-8269). Thus, this family of molecules may play 
important roles in sensory signal transduction in general. 

Vanilloid receptors (VRs) are cation channels that are structurally related to 
members of the TRP family of ion channels. VRs share several physical characteristics 
including an N-terminal cytoplasmic domain which contains three ankyrin repeats, six 

35 transmembrane domains, a pore-loop region located between transmembrane domains 5 
and 6, and several kinase consensus sequences. These receptors have been proposed to 
mediate the entry of extracellular calcium into cells in response to the depletion of 
intracellular calcium stores. VRs are expressed in nociceptive neurons, as well as other 
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cells types, and are activated by a variety of stimuli including noxious heat and protons. 
A well-known agonist of VR1 is capsaicin, which induces pain behavior in humans and 
rodents. VR1 knockout mice have been shown to be impaired in their detection of 
painful heat, to exhibit no vanilloid-evoked pain behavior, and to show little thermal 
5 hypersensitivity after inflammation (Szallasi and Blumberg (1999) Pharmacol. Rev. 
51:159-211). 

Summary of the Invention 

The present invention is based, at least in part, on the discovery of novel calcium 

10 channel family members, referred to herein as "TRP-like calcium channel-4" and "TRP- 
like calcium channel-5" or "TLCC-4" and "TLCC-5" nucleic acid and polypeptide 
molecules. The TLCC-4 and TLCC-5 nucleic acid and polypeptide molecules of the 
present invention are useful as modulating agents in regulating a variety of cellular 
processes, e.g., membrane excitability, neurite outgrowth and synaptogenesis, signal 

15 transduction, cell proliferation, growth, differentiation, and migration, and nociception. 
Accordingly, in one aspect, this invention provides isolated nucleic acid molecules 
encoding TLCC-4 and TLCC-5 polypeptides or biologically active portions thereof, as 
well as nucleic acid fragments suitable as primers or hybridization probes for the 
detection of TLCC-4-encoding and TLCC-5-encoding nucleic acids. 

20 In one embodiment, the invention features an isolated nucleic acid molecule that 

includes the nucleotide sequence set forth in SEQ ID NO:l, 3, 4, or 6. In another 
embodiment, the invention features an isolated nucleic acid molecule that encodes a 
polypeptide including the amino acid sequence set forth in SEQ ID NO:2 or 5. In 
another embodiment, the invention features isolated nucleic acid molecules that include 

25 the nucleotide sequences contained in the plasmids deposited with ATCC® as 

Accession Numbers . 

In still other embodiments, the invention features isolated nucleic acid molecules 
including nucleotide sequences that are substantially identical (e.g., 60%, 65%, 70%, 
75%, 80%, 85%, 90%, 95% or 98% identical) to the nucleotide sequence set forth as 

30 SEQ ID NO:l, 3, 4, or 6. The invention further features isolated nucleic acid molecules 
including at least 50 contiguous nucleotides of the nucleotide sequence set forth as SEQ 
ID NO:l, 3, 4, or 6. In another embodiment, the invention features isolated nucleic acid 
molecules which encode a polypeptide including an amino acid sequence that is 
substantially identical (e.g., 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95% or 98% 

35 identical) to the amino acid sequence set forth as SEQ ID NO:2 or 5. The present 
invention also features nucleic acid molecules which encode allelic variants of the 
polypeptide having the amino acid sequence set forth as SEQ ID NO: 2 or 5. In addition 
to isolated nucleic acid molecules encoding full-length polypeptides, the present 
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invention also features nucleic acid molecules which encode fragments, for example 
biologically active or antigenic fragments, of the full-length polypeptides of the present 
invention (e.g., fragments including at least 10, 15, 20, 25, 30, 35, 40, 45 or 50 
contiguous amino acid residues of the amino acid sequence of SEQ ID NO:2 or 5). In 
5 still other embodiments, the invention features nucleic acid molecules that are 

complementary to, antisense to, or hybridize under stringent conditions to the isolated 
nucleic acid molecules described herein. 

In another aspect, the invention provides vectors including the isolated nucleic 
acid molecules described herein (e.g., TLCC-4-encoding or TLCC-5-encoding nucleic 

10 acid molecules). Such vectors can optionally include nucleotide sequences encoding 
heterologous polypeptides. Also featured are host cells including such vectors (e.g., 
host cells including vectors suitable for producing TLCC-4 or TLCC-5 nucleic acid 
molecules and polypeptides). 

In another aspect, the invention features isolated TLCC-4 or TLCC-5 

15 polypeptides and/or biologically active or antigenic fragments thereof. Exemplary 

embodiments feature a polypeptide including the amino acid sequence set forth as SEQ 
ID NO:2 or 5, a polypeptide including an amino acid sequence at least 60%, 65%, 70%, 
75%, 80%, 85%, 90%, 95% or 98% identical to the amino acid sequence set forth as 
SEQ ID NO:2 or 5, a polypeptide encoded by a nucleic acid molecule including a 

20 nucleotide sequence at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95% or 98% 

identical to the nucleotide sequence set forth as SEQ ID NO:l, 3, 4, or 6. Also featured 
are fragments of the full-length polypeptides described herein (e.g., fragments including 
at least 10 contiguous amino acid residues of the sequence set forth as SEQ ID NO: 2 or 
5) as well as allelic variants of the polypeptide having the amino acid sequence set forth 

25 asSEQIDNO:2or5. 

The TLCC-4 or TLCC-5 polypeptides and/or biologically active or antigenic 
fragments thereof, are useful, for example, as reagents or targets in assays applicable to 
treatment and/or diagnosis of TLCC-4 or TLCC-5 mediated or related disorders. In one 
embodiment, a TLCC-4 or TLCC-5 polypeptide, or fragment thereof, has a TLCC-4 or 

30 TLCC-5 activity. In another embodiment, a TLCC-4 or TLCC-5 polypeptide, or 

fragment thereof, has one or more of the following domains: an ankyrin repeat domain, 
a transmembrane domain, a pore domain, a transient receptor domain, a kinase 
consensus sequence, and/or an ion transport protein domain, and optionally, has a 
TLCC-4 or TLCC-5 activity. In a related aspect, the invention features antibodies (e.g., 

35 antibodies which specifically bind to any one of the polypeptides described herein) as 
well as fusion polypeptides including all or a fragment of a polypeptide described 
herein. 
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The present invention further features methods for detecting TLCC-4 or TLCC-5 
polypeptides and/or TLCC-4 or TLCC-5 nucleic acid molecules, such methods 
featuring, for example, a probe, primer or antibody described herein. Also featured are 
kits, e.g., kits for the detection of TLCC-4 or TLCC-5 polypeptides and/or TLCC-4 or 
5 TLCC-5 nucleic acid molecules. In a related aspect, the invention features methods for 
identifying compounds which bind to and/or modulate the activity of a TLCC-4 or 
TLCC-5 polypeptide or TLCC-4 or TLCC-5 nucleic acid molecule described herein. 
Further featured are methods for modulating a TLCC-4 or TLCC-5 activity. 

10 Other features and advantages of the invention will be apparent from the 

following detailed description and claims. 

Brief Description of the Drawings 

Figure J depicts the cDNA sequence and predicted amino acid sequence of 
15 human TLCC-4. The nucleotide sequence corresponds to nucleic acids 1 to 4586 of 
SEQ ID NO: 1 . The amino acid sequence corresponds to amino acids 1 to 742 of SEQ 
ID NO:2. The coding region without the 5' and 3' untranslated regions of the human 
TLCC-4 gene is shown in SEQ ID NO:3. 

Figure 2 depicts the cDNA sequence and predicted amino acid sequence of 
20 human TLCC-5. The nucleotide sequence corresponds to nucleic acids 1 to 3042 of 
SEQ ID NO:4. The amino acid sequence corresponds to amino acids 1 to 1013 of SEQ 
ID NO:5. The coding region of the human TLCC-5 gene is shown in SEQ ID NO:6. 

Figure 3 depicts TLCC-4 mRNA expression in normal human tissues, as 
determined using RT-PCR. 
25 Figure 4 depicts a structural, hydrophobicity, and antigenicity analysis of the 

human TLCC-4 polypeptide (SEQ ID NO:2). 

Figure 5 depicts the results of a search which was performed against the HMM 
database in PFAM and which resulted in the identification of three "ankyrin repeat 
domains" and an "ion transport protein domain" in the human TLCC-4 polypeptide 
30 (SEQ ID NO:2). 

Figure 6 depicts the results of a search which was performed against the 
MEMSAT database and which resulted in the identification of six "transmembrane 
domains" in the human TLCC-4 polypeptide (SEQ ID NO:2). 

Figure 7 depicts a structural, hydrophobicity, and antigenicity analysis of the 
35 human TLCC-5 polypeptide (SEQ ID NO:5). 

Figure 8 depicts the results of a search which was performed against the HMM 
database which resulted in the identification of two "transient receptor domains" in the 
amino acid sequence of the human TLCC-5 polypeptide (SEQ ID NO: 5). 

5 
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Figure 9 depicts the results of a search which was performed against the 
MEMS AT database and which resulted in the identification of two "transmembrane 
domains" in the human TLCC-5 polypeptide (SEQ ID NO:5). 

Figure 10 depicts an alignment of the human TLCC-5 amino acid sequence 
5 (SEQ ID NO: 5) with the amino acid sequence of human transient receptor 7 (hTRP7), 
using the CLUSTAL W (1.74) alignment program. 

Figure 11 depicts an alignment of the human TLCC-5 amino acid sequence 
(SEQ ID NO: 5) with the amino acid sequence of human melastatin, using the 
CLUSTAL W (1.74) alignment program. 
10 Figure 12 is a graph depicting the expression of human TLCC-4 cDNA (SEQ ID 

NO:3) in various human tissues as determined by Taqman analysis. 

Figure 73 is a graph depicting the expression of human TLCC-4 cDNA (SEQ ID 
NO:3) in various diseased and normal state human tissues as determined by Taqman 
analysis. 

15 Figure 14 is a graph depicting the expression of human TLCC-4 cDNA (SEQ ID 

NO:3) in normal monkey and human tissues as determined by Taqman analysis. 

Figure 15 is a graph depicting the expression of human TLCC-4 cDNA (SEQ ID 
NO:3) in normal monkey and human tissues as determined by Taqman analysis. 

Figure 16 is a graph depicting the expression of human TLCC-4 cDNA (SEQ ID 
20 NO:3) in various tissues involved in metabolism as determined by Taqman analysis. 

Detailed Description of the Invention 

The present invention is based, at least in part, on the discovery of novel 
molecules, referred to herein as "TRP-like calcium channel-4" and "TRP-like calcium 

25 channel-5" or "TLCC-4" and "TLCC-5" nucleic acid and polypeptide molecules, which 
are novel members of the ion channel, e.g., calcium channel and/or vanilloid receptor, 
family. These novel molecules are capable of, for example, modulating an ion-channel 
mediated activity (e.g., a calcium channel- and/or vanilloid receptor-mediated activity) 
in a cell, e.g., a neuronal, skin, muscle (e.g., cardiac muscle), or liver cell. 

30 As used herein, an "ion channel" includes a protein or polypeptide which is 

involved in receiving, conducting, and transmitting signals in an electrically excitable 
cell, e.g., a neuronal or muscle cell. Ion channels include calcium channels, potassium 
channels, and sodium channels. As used herein, a "calcium channel" includes a protein 
or polypeptide which is involved in receiving, conducting, and transmitting calcium ion- 

35 based signals in an electrically excitable cell. Calcium channels are calcium ion 
selective, and can determine membrane excitability (the ability of, for example, a ■ 
neuronal cell to respond to a stimulus and to convert it into a sensory impulse). Calcium 
channels can also influence the resting potential of membranes, wave forms and 
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frequencies of action potentials, and thresholds of excitation. Calcium channels are 
typically expressed in electrically excitable cells, e.g., neuronal cells, and may form 
heteromultimeric structures (e.g., composed of more than one type of subunit). Calcium 
channels may also be found in non-excitable cells (e.g., adipose cells or liver cells), 
5 where they may play a role in, e.g., signal transduction. Examples of calcium channels 
include the low-voltage-gated channels and the high-voltage-gated channels. Calcium 
channels are described in, for example, Davila et al. (1999) Annals New York Academy 
ofSciences .868:102-17 and McEnery, M.W. et al (1998)J. Bioenergetics and 
Biomembranes 30(4): 409-418, the contents of which are incorporated herein by 

10 reference. As the TLCC-4 and TLCC-5 molecules of the present invention are calcium 
channels modulating ion channel mediated activities (e.g., calcium channel- and/or 
vanilloid receptor-mediated activities), they may be useful for developing novel 
diagnostic and therapeutic agents for ion channel associated disorders (e.g., calcium 
channel and/or vanilloid receptor associated disorders). 

15 As used herein, an "ion channel associated disorder" includes a disorder, disease 

or condition which is characterized by a misregulation of an ion channel mediated 
activity. For example, a "calcium channel associated disorder" includes a disorder, 
disease or condition which is characterized by a misregulation of a calcium channel 
mediated activity. Ion channel associated disorders, e.g., calcium channel associated 

20 disorders, include CNS disorders, such as cognitive and neurodegenerative disorders, 
examples of which include, but are not limited to, Alzheimer's disease, dementias 
related to Alzheimer's disease (such as Pick's disease), Parkinson's and other Lewy 
diffuse body diseases, senile dementia, Huntington's disease, Gilles de la Tourette's 
syndrome, multiple sclerosis, amyotrophic lateral sclerosis, progressive supranuclear 

25 palsy, epilepsy, Creutzfeldt- Jakob disease, or AIDS related dementia; autonomic 
function disorders such as hypertension and sleep disorders, and neuropsychiatry 
disorders, such as depression , schizophrenia, schizoaffective disorder, korsakoff s 
psychosis, mania, anxiety disorders, or phobic disorders; leaning or memory disorders, 
e.g., amnesia or age-related memory loss, attention deficit disorder, psychoactive 

30 substance use disorders, anxiety, phobias, panic disorder, as well as bipolar affective 
disorder, e.g., severe bipolar affective (mood) disorder (BP-1), and bipolar affective 
neurological disorders, e.g., migraine and obesity. Further CNS-related disorders 
include, for example, those listed in the American Psychiatric Association's Diagnostic 
and Statistical manual of Mental Disorders (DSM), the most current version of which is 

35 incorporated herein by reference in its entirety. 

Ion channel associated disorders, e.g., vanilloid receptor associated disorders 
also include pain disorders. As used herein, the term "pain disorders" includes those 
disorders, diseases or conditions that affect pain signaling mechanisms. Examples of 
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pain disorders include postherpetic neuralgia, diabetic neuropathy, postmastectomy pain 
syndrome, stump pain, reflex sympathetic dystrophy, trigeminal neuralgia, neuropathic 
pain, orofacial neuropathic pain, osteoarthritis, rheumatoid arthritis, fibromyalgia 
syndrome, tension myalgia, Guillian-Barre syndrome, Meralgia paraesthetica, burning 
5 mouth syndrome, fibrocitis, myofascial pain syndrome, idiopathic pain disorder, 
temporomandibular joint syndrome, atypical odontalgia, loin pain, haematuria 
syndrome, non-cardiac chest pain, low back pain, chronic nonspecific pain, psychogenic 
pain, musculoskeletal pain disorder, chronic pelvic pain, nonorganic chronic headache, 
tension-type headache, cluster headache, migraine, complex regional pain syndrome, 

10 vaginismus, nerve trunk pain, somatoform pain disorder, cyclical mastalgia, chronic 
fatigue syndrome, multiple somatization syndrome, chronic pain disorder, somatization 
disorder, Syndrome X, facial pain, idiopathic pain disorder, posttraumatic rheumatic 
pain modulation disorder (fibrositis syndrome), and Tangier disease. 

As used herein, the term "pain signaling mechanisms" include the cellular 

1 5 mechanisms involved in the development and regulation of pain, e.g. , pain elicited by 
noxious chemical, mechanical, or thermal stimuli, in a subject, e.g., a mammal such as a 
human. In mammals, the initial detection of noxious chemical, mechanical, or thermal 
stimuli, a process referred to as "nociception", occurs predominantly at the peripheral 
terminals of specialized, small diameter sensory neurons. These sensory neurons 

20 transmit the information to the central nervous system, evoking a perception of pain or 
discomfort and initiating appropriate protective reflexes. The TLCC-4 or TLCC-5 
molecules of the present invention may be present on these sensory neurons and, thus, 
may be involved in detecting these noxious chemical, mechanical, or thermal stimuli 
and transducing this information into membrane depolarization events. Thus, the 

25 TLCC-4 or TLCC-5 molecules by participating in pain signaling mechanisms, may 
modulate pain elicitation and act as targets for developing novel diagnostic targets and 
therapeutic agents to control pain. 

Ion channel associated disorders, e.g., calcium channel and/or vanilloid receptor 
disorders, also include cellular proliferation, growth, differentiation, or migration 

30 disorders. Cellular proliferation, growth, differentiation, or migration disorders include 
those disorders that affect cell proliferation, growth, differentiation, or migration 
processes. As used herein, a "cellular proliferation, growth, differentiation, or migration 
process" is a process by which a cell increases in number, size or content, by which a 
cell develops a specialized set of characteristics which differ from that of other cells, or 

35 by which a cell moves closer to or further from a particular location or stimulus. The 
TLCC-4 or TLCC-5 molecules of the present invention are involved in signal 
transduction mechanisms, which are known to be involved in cellular growth, 
differentiation, and migration processes. Thus, the TLCC-4 or TLCC-5 molecules may 
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modulate cellular growth, differentiation, or migration, and may play a role in disorders 
characterized by aberrantly regulated growth, differentiation, or migration. Such 
disorders include cancer, e.g., carcinoma, sarcoma, or leukemia; tumor angiogenesis and 
metastasis; skeletal dysplasia; neuronal deficiencies resulting from impaired neural 
5 induction and patterning; hepatic disorders; cardiovascular disorders; and hematopoietic 
and/or myeloproliferative disorders. 

As used herein, an "ion channel mediated activity" includes an activity which 
involves an ion channel, e.g., an ion channel and/or a vanilloid receptor, in a neuronal 
cell, a muscular cell, a skin cell or a liver cell, associated with receiving, conducting, 

10 and transmitting signals. Ion channel mediated activities {e.g. , calcium channel and/or 
vanilloid receptor mediated activities) include release of neurotransmitters or second 
messenger molecules {e.g., dopamine or norepinephrine), from cells, e.g., neuronal cells; 
modulation of resting potential of membranes, wave forms and frequencies of action 
potentials, and thresholds of excitation; participation in signal transduction pathways, 

15 and modulation of processes such as integration of sub-threshold synaptic responses and 
the conductance of back-propagating action potentials in, for example, neuronal cells 
{e.g., changes in those action potentials resulting in a morphological or differentiative 
response in the cell). 

The term "family" when referring to the polypeptide and nucleic acid molecules 

20 of the invention is intended to mean two or more polypeptides or nucleic acid molecules 
having a common structural domain or motif and having sufficient amino acid or 
nucleotide sequence homology as defined herein. Such family members can be 
naturally or non-naturally occurring and can be from either the same or different 
species. For example, a family can contain a first protein of human origin, as well as 

25 other, distinct proteins of human origin or alternatively, can contain homologues of non- 
human origin, e.g., monkey proteins. Members of a family may also have common 
functional characteristics. 

For example, the family of TLCC-4 and TLCC-5 polypeptides comprise at least 
one "transmembrane domain." As used herein, the term "transmembrane domain" 

30 includes an amino acid sequence of about 20-45 amino acid residues in length which 
spans the plasma membrane. More preferably, a transmembrane domain includes about 
at least 20, 25, 30, 35, 40, or 45 amino acid residues and spans the plasma membrane. 
Transmembrane domains are rich in hydrophobic residues, and typically have an alpha- 
helical structure. In a preferred embodiment, at least 50%, 60%, 70%, 80%, 90%, 95% 

35 or more of the amino acids of a transmembrane domain are hydrophobic, e.g., leucines, 
isoleucines, alanines, valines, phenylalanines, prolines or methionines. Transmembrane 
domains are described in, for example, Zagotta W.N. et al y (1996) Annual Rev. 
Neurosci. 19: 235-263, the contents of which are incorporated herein by reference. 
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Amino acid residues 440-461, 488-508, 520-540, 547-565, 590-609, and 652-676 of the 
human TLCC-4 polypeptide comprise transmembrane domains (see, e.g., Figures 4 and 
6). Amino acid residues 786-803 and 826-848 of the human TLCC-5 polypeptide 
comprise transmembrane domains (see Figures 7 and 9). Accordingly, TLCG-4 and/or 
5 TLCC-5 polypeptides having at least 50-60% homology, preferably about 60-70%, 
more preferably about 70-80%, or about 80-90% homology with a transmembrane 
domain of human TLCC-4 and/or TLCC-5 are within the scope of the invention. 

In another embodiment, a TLCC-4 molecule of the present invention is identified 
based on the presence of at least one "pore domain" between the fifth and sixth 

10 transmembrane domains. As used herein, the term "pore domain" includes an overall 
hydrophobic amino acid sequence which is located between two transmembrane 
domains of a calcium channel protein, preferably transmembrane domains 5 and 6, and 
which is believed to be a major determinant of ion selectivity and channel activity in 
calcium channels. Pore domains are described in, for example Vannier et aL (1998) J. 

15 Biol. Chem. 273: 8675-8679 and Phillips, A. M. et al (1992) Neuron 8, 631-642, the 
contents of which are incorporated herein by reference. TLCC-4 molecules having at 
least one pore domain are within the scope of the invention. Amino acid residues 620- 
640 of the human TLCC-4 sequence (SEQ ID NO:2) comprise a pore domain (Figure 4). 
In another embodiment, a TLCC-4 molecule of the present invention is identified 

20 based on the presence of at least one "ankyrin repeat domain." As used herein, the term 
"ankyrin repeat domain" includes an amino acid sequence of about 10-110 amino acid 
residues which serves as an ankyrin repeat. Preferably, an ankyrin repeat domain 
includes at least about 30 amino acid residues. To identify the presence of an ion 
transport domain in a TLCC-4 protein, and make the determination that a protein of 

25 interest has a particular profile, the amino acid sequence of the protein may be searched 
against a database of known protein domains (e.g., the HMM database). The ankyrin 
repeat domain (HMM) has been assigned the PFAM Accession PF00023 (at the PFAM 
website). A search was performed against the HMM database resulting in the 
identification of ankyrin repeat domains in the amino acid sequence of human TLCC-4 

30 (SEQ ID NO:2) at about residues 167-202, 214-246, 261-294, and 340-372 of SEQ ID 
NO:2. The results of the search are set forth in Figure 5. 

In another embodiment, a TLCC-4 molecule of the present invention is identified 
based on the presence of at least one "ion transport protein domain." As used herein, the 
term "ion transport protein domain" includes a protein domain having an amino acid 

35 sequence of about 100-200 amino acid residues which serves to transport ions. 
Preferably, an ion transport protein domain includes at least about 160 amino acid 
residues. To identify the presence of an ion transport protein domain in a TLCC-4 
protein, and make the determination that a protein of interest has a particular profile, the 

10 
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amino acid sequence of the protein may be searched against a database of known protein 
domains (e.g., the HMM database). The ion transport domain (HMM) has been 
assigned the PFAM Accession PF00520 (at the world wide web address: 
genome.wustl.edu/Pfam/html). A search was performed against the HMM database 
5 resulting in the identification of an ion transport protein domain in the amino acid 
sequence of human TLCC-4 (SEQ ID NO:2) at about residues 510-677 of SEQ ID 
NO:2. The results of the search are set forth in Figure 5. 

In another embodiment, a TLCC-5 molecule of the present invention is identified 
based on the presence of at least one "transient receptor domain." As used herein, the 

10 term "transient receptor domain" includes a protein domain having an amino acid 
sequence of about 100-200 amino acid residues which is found in transient receptor 
potential (Trp) proteins and related ion channel proteins. Preferably, a transient receptor 
domain includes at least about 56-58 amino acid residues. To identify the presence of a 
transient receptor domain in a TLCC-5 protein, and make the determination that a 

15 protein of interest has a particular profile, the amino acid sequence of the protein may be 
searched against a database of known protein domains (e.g., the HMM database). The 
transient receptor domain (HMM) has been assigned the PFAM Accession PF02164 (at 
the world wide web address: genome.wustl.edu/Pfam/html). A search was performed 
against the HMM database resulting in the identification of transient receptor domains in 

20 the amino acid sequence of human TLCC-5 (SEQ ID NO:5) at about residues 720-778 
and 820-876 of SEQ ID NO:5. The results of the search are set forth in Figure 8. 

A description of the Pfam database can be found in Sonhammer et ah (1997) 
Proteins 28:405-420 and a detailed description of HMMs can be found, for example, in 
Gribskov et al(\990) Meth Enzymol 183:146-159; Gribskov et a/.(1987) Proc. Natl 

25 Acad ScL USA 84:4355-4358; Krogh et #7.(1994)./ Mol Biol 235:1501-1531; and 
Stultz et al (1993) Protein Sci. 2:305-314, the contents of which are incorporated herein 
by reference. 

In a preferred embodiment, the TLCC-4 or TLCC-5 molecules of the invention include 
at least one transmembrane domain, at least one ankyrin repeat domain, at least one pore 
30 domain, at least one transient receptor domain, and/or at least one ion transport protein 
domain. 

In a preferred embodiment, a TLCC-4 or TLCC-5 polypeptide includes at least 
one or more of the following domains: an ankyrin repeat domain, and/or a 
transmembrane domain, and/or a pore domain, and/or a transient receptor domain, 
35 and/or an ion transport protein domain, and has an amino acid sequence at least about 
50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or more 
homologous or identical to the amino acid sequence of SEQ ID NO:2 or 5, or the amino 
acid sequences encoded by the DNA inserts of the plasmids deposited with ATCC as 

ll 

BNSDOCID: <WO 0200722A2_I_> 



WO 02/00722 



PCT/US01/20640 



Accession Numbers . In yet another preferred embodiment, a TLCC-4 or TLCC-5 

polypeptide includes at least one or more of the following domains: an ankyrin repeat 
domain, and/or a transmembrane domain, and/or a pore domain, and/or a transient 
receptor domain, and/or an ion transport protein domain, and is encoded by a nucleic 
5 acid molecule having a nucleotide sequence which hybridizes under stringent 

hybridization conditions to a complement of a nucleic acid molecule comprising the 
nucleotide sequence of SEQ ID NO:l, 3, 4, or 6. In another preferred embodiment, a 
TLCC-4 or TLCC-5 polypeptide includes at least one or more of the following 
domains: an ankyrin repeat domain, and/or a transmembrane domain, and/or a pore 

10 domain, and/or a transient receptor domain, and/or an ion transport protein domain, and 
has a TLCC-4 or TLCC-5 activity. 

As used interchangeably herein, a "TLCC-4 or TLCC-5 activity", "biological 
activity of TLCC-4 or TLCC-5 " or "functional activity of TLCC-4 or TLCC-5", refers 
to an activity exerted by a TLCC-4 or TLCC-5 polypeptide or nucleic acid molecule on 

15 a TLCC-4 or TLCC-5 responsive cell or tissue, or on a TLCC-4 or TLCC-5 polypeptide 
substrate, as determined in vivo, or in vitro, according to standard techniques. In one 
embodiment, a TLCC-4 or TLCC-5 activity is a direct activity, such as an association 
with a TLCC-4-target molecule or TLCC-5-target molecule. As used herein, a "target 
molecule" or "binding partner" is a molecule with which a TLCC-4 or TLCC-5 

20 polypeptide binds or interacts in nature, such that TLCC-4-mediated or TLC C-5 - 
mediated function is achieved. A TLCC-4 or TLCC-5 target molecule can be a non- 
TLCC-4 or non-TLCC-5 molecule or a TLCC-4 or TLCC-5 polypeptide or polypeptide 
of the present invention. In an exemplary embodiment, a TLCC-4 or TLCC-5 target 
molecule is a TLCC-4 or TLCC-5 ligand, e.g., a calcium channel ligand such as 

25 calcium. Alternatively, a TLCC-4 or TLCC-5 activity is an indirect activity, such as a 
cellular signaling activity mediated by interaction of the TLCC-4 or TLCC-5 
polypeptide with a TLCC-4 or TLCC-5 ligand. The biological activities of TLCC-4 or 
TLCC-5 are described herein. For example, the TLCC-4 or TLCC-5 polypeptides of the 
present invention can have one or more of the following activities: (1) modulate 

30 membrane excitability, (2) influence the resting potential of membranes, (3) modulate 
wave forms and frequencies of action potentials, (4) modulate thresholds of excitation, 
(5) modulate neurite outgrowth and synaptogenesis, (6) modulate signal transduction, 
and (7) participate in nociception. 

Accordingly, another embodiment of the invention features isolated TLCC-4 or 

35 TLCC-5 polypeptides and polypeptides having a TLCC-4 or TLCC-5 activity. Preferred 
polypeptides are TLCC-4 or TLCC-5 polypeptides having at least one or more of the 
following domains: an ankyrin repeat domain, and/or a transmembrane domain, and/or a 
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pore domain, and/or a transient receptor domain, and/or an ion transport protein domain 
and, preferably, a TLCC-4 or TLCC-5 activity. 

Additional preferred polypeptides have one or more of the following domains: an 
ankyrin repeat domain, a transmembrane domain, a pore domain, a transient receptor 
5 domain, and/or an ion transport protein domain, and are, preferably, encoded by a 
nucleic acid molecule having a nucleotide sequence which hybridizes under stringent 
hybridization conditions to a nucleic acid molecule comprising the nucleotide sequence 
ofSEQ ID NO: 1,3, 4, or 6. 

The nucleotide sequence of the isolated human TLCC-4 and TLCC-5 cDNA and 
10 the predicted amino acid sequences of the human TLCC-4 and TLCC-5 polypeptides are 
shown in Figures 1 and 2, and in SEQ ID NOs: 1 and 2, and 3 and 4, respectively. 
Plasmids containing the nucleotide sequences encoding human TLCC-4 and TLCC-5 
were deposited with the American Type Culture Collection (ATCC), 10801 University 

Boulevard, Manassas, VA 201 10-2209, on and assigned Accession Numbers 

15 . These deposits will be maintained under the terms of the Budapest Treaty on 

the International Recognition of the Deposit of Microorganisms for the Purposes of 
Patent Procedure. These deposits were made merely as a convenience for those of skill 
in the art and are not an admission that a deposit is required under 35 U.S.C. §1 12. 

The human TLCC-4 gene, which is approximately 3321 nucleotides in length, 
20 encodes a polypeptide which is approximately 751 amino acid residues in length. The 
human TLCC-5 gene, which is approximately 3042 nucleotides in length, encodes a 
polypeptide which is approximately 1013 amino acid residues in length. 

Various aspects of the invention are described in further detail in the following 
subsections: 

25 

L Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated nucleic acid molecules that 
encode TLCC-4 or TLCC-5 polypeptides or biologically active portions thereof, as well 
as nucleic acid fragments sufficient for use as hybridization probes to identify TLCC-4- 

30 encoding or TLCC-5-encoding nucleic acid molecules (e.g. , TLCC-4 or TLCC-5 
mRNA) and fragments for use as PCR primers for the amplification or mutation of 
TLCC-4 or TLCC-5 nucleic acid molecules. As used herein, the term "nucleic acid 
molecule" is intended to include DNA molecules {e.g., cDNA or genomic DNA) and 
RNA molecules (e.g., mRNA) and analogs of the DNA or RNA generated using 

35 nucleotide analogs. The nucleic acid molecule can be single-stranded or double- 
stranded, but preferably is double-stranded DNA. 
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The term "isolated nucleic acid molecule" includes nucleic acid molecules which 
are separated from other nucleic acid molecules which are present in the natural source 
of the nucleic acid. For example, with regards to genomic DNA, the term "isolated" 
includes nucleic acid molecules which are separated from the chromosome with which 
5 the genomic DNA is naturally associated. Preferably, an "isolated" nucleic acid is free 
of sequences which naturally flank the nucleic acid (i.e., sequences located at the 5' and 
3* ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic 
acid is derived. For example, in various embodiments, the isolated TLCC-4 or TLCC-5 
nucleic acid molecule can contain less than about 5 kb, 4kb, 3kb, 2kb, 1 kb, 0.5 kb or 0.1 

10 kb of nucleotide sequences which naturally flank the nucleic acid molecule in genomic 
DNA of the cell from which the nucleic acid is derived. Moreover, an "isolated" nucleic 
acid molecule, such as a cDNA molecule, can be substantially free of other cellular 
material, or culture medium when produced by recombinant techniques, or substantially 
free of chemical precursors or other chemicals when chemically synthesized. 

15 A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule 

having the nucleotide sequence of SEQ ID NO:l, 3, 4, or 6, or the nucleotide sequences 

of the DNA inserts of the plasmids deposited with ATCC as Accession Numbers , 

or a portion thereof, can be isolated using standard molecular biology techniques and the 
sequence information provided herein. Using all or a portion of the nucleic acid 

20 sequence of SEQ ID NO:l, 3, 4, or 6, or the nucleotide sequences of the DNA inserts of 

the plasmids deposited with ATCC as Accession Numbers , as a hybridization 

probe, TLCC-4 or TLCC-5 nucleic acid molecules can be isolated using standard 
hybridization and cloning techniques (e.g., as described in Sambrook, J., Fritsh, E. F., 
and Maniatis, T. Molecular Cloning: A Laboratory Manual. 2nd, ed. t Cold Spring 

25 Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989). 

Moreover, a nucleic acid molecule encompassing all or a portion of SEQ ID 
NO:l, 3, 4, or 6, or the nucleotide sequences of the DNA inserts of the plasmids 
deposited with ATCC as Accession Numbers can be isolated by the polymerase 

30 chain reaction (PCR) using synthetic oligonucleotide primers designed based upon the 
sequence of SEQ ID NO: 1, 3, 4, or 6, or the nucleotide sequences of the DNA inserts of 

the plasmids deposited with ATCC as Accession Numbers . 

A nucleic acid of the invention can be amplified using cDNA, mRNA or 
alternatively, genomic DNA, as a template and appropriate oligonucleotide primers 

35 according to standard PCR amplification techniques. The nucleic acid so amplified can 
be cloned into an appropriate vector and characterized by DNA sequence analysis. 
Furthermore, oligonucleotides corresponding to TLCC-4 or TLCC-5 nucleotide 
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sequences can be prepared by standard synthetic techniques, e.g., using an automated 
DNA synthesizer. 

In one embodiment, an isolated nucleic acid molecule of the invention comprises 
the nucleotide sequence shown in SEQ ID NO:l or SEQ ID NO:4. The sequence of 
5 SEQ ID NO: 1 corresponds to the human TLCC-4. This cDNA comprises sequences 
encoding the human TLCC-4 polypeptide (i.e., "the coding region", from nucleotides 
146-2368) as well as 5 1 untranslated sequences (nucleotides 1-145) and 3 1 untranslated 
sequences (nucleotides 2369-4586). Alternatively, the nucleic acid molecule can 
comprise only the coding region of SEQ ID NO:l (e.g., nucleotides 146-2368, 

10 corresponding to SEQ ID NO:3). Accordingly, in another embodiment, the isolated 

nucleic acid molecule comprises SEQ ID NO:3 and nucleotides 1-145 and 2369-4586 of 
SEQ ID NO: 1 . The sequence of SEQ ID NO:4 corresponds to the human TLCC-5. 
This cDNA comprises sequences encoding the human TLCC-5 polypeptide (i.e., "the 
coding region", from nucleotides 1-3042). In yet another embodiment, the nucleic acid 

15 molecule consists of the nucleotide sequence set forth as SEQ ID NO: 1, 3, 4, or 6. 

In still another embodiment, an isolated nucleic acid molecule of the invention 
comprises a nucleic acid molecule which is a complement of the nucleotide sequence 
shown in SEQ ID NO:l ? 3, 4, or 6, or the nucleotide sequences of the DNA inserts of the 
plasmids deposited with ATCC as Accession Numbers , or a portion of any of 

20 these nucleotide sequences. A nucleic acid molecule which is complementary to the 
nucleotide sequence shown in SEQ ID NO:l , 3, 4, or 6, or the nucleotide sequences of 

the DNA inserts of the plasmids deposited with ATCC as Accession Numbers , is 

one which is sufficiently complementary to the nucleotide sequence shown in SEQ ID 
NO.T, 3, 4, or 6, or the nucleotide sequences of the DNA inserts of the plasmids 

25 deposited with ATCC as Accession Numbers , such that it can hybridize to the 

nucleotide sequence shown in SEQ ID NO:l, 3, 4, or 6, or the nucleotide sequences of 

the DNA inserts of the plasmids deposited with ATCC as Accession Numbers , 

thereby forming a stable duplex. 

In still another preferred embodiment, an isolated nucleic acid molecule of the 

30 present invention comprises a nucleotide sequence which is at least about 50%, 55%, 
60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or more identical 
to the entire length of the nucleotide sequence shown in SEQ ID NO:l, 3, 4, or 6, (e.g., 
the entire length of the nucleotide sequence), or to the nucleotide sequences (e.g., the 
entire length of the nucleotide sequence) of the DNA inserts of the plasmids deposited 

35 with ATCC as Accession Numbers , or to a portion or complement of any of these 

nucleotide sequences. In one embodiment, a nucleic acid molecule of the present 
invention comprises a nucleotide sequence which is at least 50, 100, 150, 200, 250, 300, 
350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1050, 1 100, 
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1150, 1200, 1250, 1300, 1350, 1400, 1450, 1500 or more nucleotides in length and 
hybridizes under stringent hybridization conditions to a complement of a nucleic acid 
molecule of SEQ ID NO:l, 3, 4, or 6 or the nucleotide sequences of the DNA inserts of 

the plasmids deposited with ATCC as Accession Numbers . 

5 Moreover, the nucleic acid molecule of the invention can comprise only a 

portion of the nucleic acid sequence of SEQ ID NO:l, 3, 4, or 6, or the nucleotide 
sequences of the DNA inserts of the plasmids deposited with ATCC as Accession 

Numbers , for example, a fragment which can be used as a probe or primer or a 

fragment encoding a portion of a TLCC-4 or TLCC-5 polypeptide, e.g., a biologically 

10 active portion of a TLCC-4 or TLCC-5 polypeptide. The nucleotide sequence 

determined from the cloning of the TLCC-4 or TLCC-5 gene allows for the generation 
of probes and primers designed for use in identifying and/or cloning other TLCC-4 or 
TLCC-5 family members, as well as TLCC-4 or TLCC-5 homologues from other 
species. The probe/primer typically comprises substantially purified oligonucleotide. 

15 The probe/primer (e.g., oligonucleotide) typically comprises a region of nucleotide 

sequence that hybridizes under stringent conditions to at least about 12 or 15, preferably 
about 20 or 25, more preferably about 30, 35, 40, 45, 50, 55, 60, 65, 75, 80, 85, 90, 95, 
or 100 or more consecutive nucleotides of a sense sequence of SEQ ID NO:l, 3, 4, or 6, 
or the nucleotide sequences of the DNA inserts of the plasmids deposited with ATCC as 

20 Accession Numbers , of an anti-sense sequence of SEQ ID NO:l, 3, 4, or 6, or the 

nucleotide sequences of the DNA inserts of the plasmids deposited with ATCC as 

Accession Numbers j , or of a naturally occurring allelic variant or mutant of SEQ 

ID NO:l, 3, 4, or 6, or the nucleotide sequences of the DNA inserts of the plasmids 
deposited with ATCC as Accession Numbers . 

25 Exemplary probes or primers are at least (or no greater than) 1 2 or 1 5, 20 or 25, 

30, 35, 40, 45, 50, 55, 60, 65, 70, 75 or more nucleotides in length and/or comprise 
consecutive nucleotides of an isolated nucleic acid molecule described herein. Also 
included within the scope of the present invention are probes or primers comprising 
contiguous or consecutive nucleoitdes of an isolated nucleic acid molecule described 

30 herein, but for the difference of 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 bases within the probe or 
primer sequence. Probes based on the TLCC-4 or TLCC-5 nucleotide sequences can be 
used to detect (e.g., specifically detect) transcripts or genomic sequences encoding the 
same or homologous polypeptides. In preferred embodiments, the probe further 
comprises a label group attached thereto, e.g., the label group can be a radioisotope, a 

35 fluorescent compound, an enzyme, or an enzyme co-factor. In another embodiment a 
set of primers is provided, e.g., primers suitable for use in PCR, which can be used to 
amplify a selected region of a TLCC-4 or TLCC-5 sequence, e.g., a domain, region, site 
or other sequence described herein. The primers should be at least 5, 10, or 50 base 

M 
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pairs in length and less than 100, or less than 200, base pairs in length. The primers 
should be identical, or differ by no greater than 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 bases when 
compared to a sequence disclosed herein or to the sequence of a naturally occurring 
variant. Such probes can be used as a part of a diagnostic test kit for identifying cells or 
5 tissue which misexpress a TLCC-4 or TLCC-5 polypeptide, such as by measuring a 
level of a TLCC-4-encoding or TLCC-5-encoding nucleic acid in a sample of cells from 
a subject e.g., detecting TLCC-4 or TLCC-5 mRNA levels or determining whether a 
genomic TLCC-4 or TLCC-5 gene has been mutated or deleted. 

A nucleic acid fragment encoding a "biologically active portion of a TLCC-4 or 

10 TLCC-5 polypeptide" can be prepared by isolating a portion of the nucleotide sequence 
of SEQ ID NO:l, 3, 4, or 6, or the nucleotide sequences of the DNA inserts of the 

plasmids deposited with ATCC as Accession Numbers , which encodes a 

polypeptide having a TLCC-4 or TLCC-5 biological activity (the biological activities of 
the TLCC-4 or TLCC-5 polypeptides are described herein), expressing the encoded 

15 portion of the TLCC-4 or TLCC-5 polypeptide (e.g., by recombinant expression in vitro) 
and assessing the activity of the encoded portion of the TLCC-4 or TLCC-5 polypeptide. 
In an exemplary embodiment, the nucleic acid molecule is at least 50-100, 100-250, 
250-500, 500-700, 700-1000, 1000-1250, 1250-1500, 1500-1750, 1750-2000, 2000- 
2250, 2250-2500, 2500-2750, 2750-300 or more nucleotides in length and encodes a 

20 protein having a TLCC-4 or TLCC-5 activity (as described herein). 

The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequence shown in SEQ ID NO:l, 3, 4, or 6, or the nucleotide sequences of 

the DNA inserts of the plasmids deposited with ATCC as Accession Numbers . 

Such differences can be due to degeneracy of the genetic code, thus resulting in a 

25 nucleic acid which encodes the same TLCC-4 or TLCC-5 polypeptides as those encoded 
by the nucleotide sequence shown in SEQ ID NO:l, 3, 4, or 6, or the nucleotide 
sequences of the DNA inserts of the plasmids deposited with ATCC as Accession 

Numbers . In another embodiment, an isolated nucleic acid molecule of the 

invention has a nucleotide sequence encoding a polypeptide having an amino acid 

30 sequence which differs by at least 1, but no greater than 5, 10, 20, 50 or 100 amino acid 
residues from the amino acid sequence shown in SEQ ID NO:2 or 5, or the amino acid 
sequences encoded by the DNA inserts of the plasmids deposited with the ATCC as 

Accession Numbers . In yet another embodiment, the nucleic acid molecule 

encodes the amino acid sequence of human TLCC-4 or human TLCC-5. If an alignment 

35 is needed for this comparison, the sequences should be aligned for maximum homology. 
Nucleic acid variants can be naturally occurring, such as allelic variants (same 
locus), homologues (different locus), and orthologues (different organism) or can be non 
naturally occurring. Non-naturally occurring variants can be made by mutagenesis 
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techniques, including those applied to polynucleotides, cells, or organisms. The variants 
can contain nucleotide substitutions, deletions, inversions and insertions. Variation can 
occur in either or both the coding and non-coding regions. The variations can produce both 
conservative and non-conservative amino acid substitutions (as compared to the encoded 
5 product). 

Allelic variants result, for example, from DNA sequence polymorphisms within 
a population (e.g., the human population) that lead to changes in the amino acid 
sequences of the TLCC-4 or TLCC-5 polypeptides. Such genetic polymorphisms in the 
TLCC-4 or TLCC-5 genes may exist among individuals within a population due to 

10 natural allelic variation. As used herein, the terms "gene" and "recombinant gene" refer 
to nucleic acid molecules which include an open reading frame encoding a TLCC-4 or 
TLCC-5 polypeptide, preferably a mammalian TLCC-4 or TLCC-5 polypeptide, and can 
further include non-coding regulatory sequences, and introns. 

Accordingly, in one embodiment, the invention features isolated nucleic acid 

1 5 molecules which encode a naturally occurring allelic variant of a polypeptide 

comprising the amino acid sequence of SEQ ID NO:2 or 5, or amino acid sequences 
encoded by the DNA inserts of the plasmids deposited with ATCC as Accession 

Numbers , wherein the nucleic acid molecule hybridizes to a complement of a 

nucleic acid molecule comprising SEQ ID NO:l, 3, 4, or 6, for example, under stringent 

20 hybridization conditions. 

Allelic variants of human TLCC-4 or TLCC-5 include both functional and non- 
functional TLCC-4 or TLCC-5 polypeptides. Functional allelic variants are naturally 
occurring amino acid sequence variants of the human TLCC-4 or TLCC-5 polypeptide 
that maintain the ability to bind a TLCC-4 or TLCC-5 ligand or substrate and/or 

25 modulate membrane excitability or signal transduction. Functional allelic variants will 
typically contain only conservative substitution of one or more amino acids of SEQ ID 
NO:2 or 5, or substitution, deletion or insertion of non-critical residues in non-critical 
regions of the polypeptide. 

Non-functional allelic variants are naturally occurring amino acid sequence 

30 variants of the human TLCC-4 or TLCC-5 polypeptide that do not have the ability to 
form functional calcium channels or to modulate membrane excitability. Non-functional 
allelic variants will typically contain a non-conservative substitution, a deletion, or 
insertion or premature truncation of the amino acid sequence of SEQ ID NO:2 or 5, or a 
substitution, insertion or deletion in critical residues or critical regions. 

35 The present invention further provides non-human orthologues (e.g. , non-human 

orthologues of the human TLCC-4 or TLCC-5 polypeptides). Orthologues of the human 
TLCC-4 or TLCC-5 polypeptide are polypeptides that are isolated from non-human 
organisms and possess the same TLCC-4 or TLCC-5 ligand binding and/or modulation 
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of membrane excitation mechanisms of the human TLCC-4 or TLCC-5 polypeptide. 
Orthologues of the human TLCC-4 or TLCC-5 polypeptide can readily be identified as 
comprising an amino acid sequence that is substantially identical to SEQ ID NO:2 or 5. 
Moreover, nucleic acid molecules encoding other TLCC-4 or TLCC-5 family 
5 members and, thus, which have a nucleotide sequence which differs from the TLCC-4 or 
TLCC-5 sequences of SEQ ID NO:l, 3, 4, or 6, or the nucleotide sequences of the DNA 

inserts of the plasmids deposited with ATCC as Accession Numbers are intended 

to be within the scope of the invention. For example, another TLCC-4 or TLCC-5 
cDNA can be identified based on the nucleotide sequence of human TLCC-4 or TLCC- 

10 5. Moreover, nucleic acid molecules encoding TLCC-4 or TLCC-5 polypeptides from 
different species, and which, thus, have a nucleotide sequence which differs from the 
TLCC-4 or TLCC-5 sequences of SEQ ID NO:l, 3, 4, or 6, or the nucleotide sequences 

of the DNA inserts of the plasmids deposited with ATCC as Accession Numbers 

are intended to be within the scope of the invention. For example, a mouse TLCC-4 or 

15 TLCC-5 cDNA can be identified based on the nucleotide sequence of a human TLCC-4 
or TLCC-5. 

Nucleic acid molecules corresponding to natural allelic variants and homologues 
of the TLCC-4 or TLCC-5 cDNAs of the invention can be isolated based on their 
homology to the TLCC-4 or TLCC-5 nucleic acids disclosed herein using the cDNAs 

20 disclosed herein, or a portion thereof, as a hybridization probe according to standard 
hybridization techniques under stringent hybridization conditions. Nucleic acid 
molecules corresponding to natural allelic variants and homologues of the TLCC-4 or 
TLCC-5 cDNAs of the invention can further be isolated by mapping to the same 
chromosome or locus as the TLCC-4 or TLCC-5 gene. 

25 Orthologues, homologues and allelic variants can be identified using methods 

known in the art (e.g., by hybridization to an isolated nucleic acid molecule of the 
present invention, for example, under stringent hybridization conditions). In one 
embodiment, an isolated nucleic acid molecule of the invention is at least 15, 20, 25, 30 
or more nucleotides in length and hybridizes under stringent conditions to the nucleic 

30 acid molecule comprising the nucleotide sequence of SEQ ID NO:l, 3, 4, or 6, or the 
nucleotide sequences of the DNA inserts of the plasmids deposited with ATCC as 

Accession Numbers . In another embodiment, the nucleic acid is at least 100, 

100-150, 150-200, 200-250, 250-300, 300-350, 350-400, 400-450, 450-500, 500-550, 
550-600, 600-650, 650-700, 700-750, 750-800, 800-850, 850-900, 900-950, 950-1000, 

35 1000-1050, 1050-1070, 1070-1100, 1100-1150, 1150-1200, 1200-1250, 1250-1300, 
1300-1350, 1350-1400, 1400-1450, 1450-1500, 1500-1550, 1550-1600, 1600-1650, 
1650-1700, 1700-1750, 1750-1800, 1800-1850, 1850-1900, 1900-1950, 1950-2000, 
2000-2050, 2050-2100, 2100-2150, 2150-2200, 2200-2250, 2250-2300, 2300-2350, 
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2350-2400, 2400-2450, 2450-2500, 2500-2550, 2550-2600, 2600-2650, 2650-2700, 
2700-2750, 2750-2800, 2800-2850, 2850-2900, 2900-2950, 2950-3000, 3000-3050, 
3050-3100, 3100-3150, 3150-3200, 3200-3250, 3250-3300, 3300-3350, 3350-3400, 
3400-3450, 3450-3500, 3500-3550, 3550-3600, 3600-3650, 3650-3700, 3700-3750, 
5 3750-3800, 3800-3850, 3850-3900, 3900-3950, 3950-4000, 4000-4050, 4050-4100, 
4100-4150, 4150-4200, 4200-4250, 4250-4300, 4300-4350, 4350-4400, 4400-4450, 
4450-4500 or more nucleotides in length. 

As used herein, the term "hybridizes under stringent conditions" is intended to 
describe conditions for hybridization and washing under which nucleotide sequences 

10 that are significantly identical or homologous to each other remain hybridized to each 
other. Preferably, the conditions are such that sequences at least about 70%, more 
preferably at least about 80%, even more preferably at least about 85% or 90% identical 
to each other remain hybridized to each other. Such stringent conditions are known to 
those skilled in the art and can be found in Current Protocols in Molecular Biology, 

15 Ausubel et al, eds., John Wiley & Sons, Inc. (1995), sections 2, 4 and 6. Additional 
stringent conditions can be found in Molecular Cloning: A Laboratory Manual, 
Sambrook et al 9 Cold Spring Harbor Press, Cold Spring Harbor, NY (1989), chapters 7, 
9 and 1 1 . A preferred, non-limiting example of stringent hybridization conditions 
includes hybridization in 4X sodium chloride/sodium citrate (SSC), at about 65-70°C (or 

20 hybridization in 4X SSC plus 50% formamide at about 42-50°C) followed by one or 

more washes in IX SSC, at about 65-70°C. A preferred, non-limiting example of highly 
stringent hybridization conditions includes hybridization in IX SSC, at about 65-70°C 
(or hybridization in IX SSC plus 50% formamide at about 42-50°C) followed by one or 
more washes in 0.3X SSC, at about 65-70°C. A preferred, non-limiting example of 

25 reduced stringency hybridization conditions includes hybridization in 4X SSC, at about 
50-60°C (or hybridization in 6X SSC plus 50% formamide at about 40-45°C) followed 
by one or more washes in 2X SSC, at about 50-60°C. Ranges intermediate to the above- 
recited values, e.g. , at 65-70°C or at 42-50°C are also intended to be encompassed by 
the present invention. SSPE (lxSSPE is 0.15MNaCl, lOmM NaH 2 P0 4 , and 1.25mM 

30 EDTA, pH 7.4) can be substituted for SSC (IxSSC is 0.1 5M NaCl and 1 5mM sodium 
citrate) in the hybridization and wash buffers; washes are performed for 15 minutes each 
after hybridization is complete. The hybridization temperature for hybrids anticipated to 
be less than 50 base pairs in length should be 5-10°C less than the melting temperature 
(T m ) of the hybrid, where T m is determined according to the following equations. For 

35 hybrids less than 1 8 base pairs in length, T m (°C) = 2(# of A + T bases) + 4(# of G + C 
bases). For hybrids between 18 and 49 base pairs in length, T m (°C) = 81.5 + 
16.6(logio[Na + ]) + 0.41 (%G+C) - (600/N), where N is the number of bases in the hybrid, 
and [Na + ] is the concentration of sodium ions in the hybridization buffer ([Na + ] for 

20 
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lxSSC = 0.165 M). It will also be recognized by the skilled practitioner that additional 
reagents may be added to hybridization and/or wash buffers to decrease non-specific 
hybridization of nucleic acid molecules to membranes, for example, nitrocellulose or 
nylon membranes, including but not limited to blocking agents (e.g., BSA or salmon or 
5 herring sperm carrier DNA), detergents (e.g., SDS), chelating agents (e.g., EDTA), 
Ficoll, PVP and the like. When using nylon membranes, in particular, an additional 
preferred, non-limiting example of stringent hybridization conditions is hybridization in 
0.25-0.5M NaH 2 P0 4 , 7% SDS at about 65°C, followed by one or more washes at 0.02M 
NaH 2 P0 4 , 1% SDS at 65°C, see, e.g., Church and Gilbert (1984) Proc. Natl. Acad. Set 

10 USA 81:1991-1995, (or alternatively 0.2XSSC, 1% SDS). 

Preferably, an isolated nucleic acid molecule of the invention that hybridizes 
under stringent conditions to the sequence of SEQ ID NO:l, 3, 4, or 6 and corresponds 
to a naturally-occurring nucleic acid molecule. As used herein, a "naturally-occurring 11 
nucleic acid molecule refers to an RNA or DNA molecule having a nucleotide sequence 

1 5 that occurs in nature (e.g. , encodes a natural polypeptide). 

In addition to naturally-occurring allelic variants of the TLCC-4 or TLCC-5 
sequences that may exist in the population, the skilled artisan will further appreciate that 
changes can be introduced by mutation into the nucleotide sequences of SEQ ID NO:l, 3, 
4, or 6, or the nucleotide sequences of the DNA inserts of the plasmids deposited with 

20 ATCC as Accession Numbers , thereby leading to changes in the amino acid 

sequence of the encoded TLCC-4 or TLCC-5 polypeptides, without altering the 
functional ability of the TLCC-4 or TLCC-5 polypeptides. For example, nucleotide 
substitutions leading to amino acid substitutions at "non-essential" amino acid residues 
can be made in the sequence of SEQ ID NO:l, 3, 4, or 6, or the nucleotide sequences of 

25 the DNA inserts of the plasmids deposited with ATCC as Accession Numbers . A 

"non-essential" amino acid residue is a residue that can be altered from the wild-type 
sequence of TLCC-4 or TLCC-5 (e.g., the sequence of SEQ ID NO:2 or 5) without 
altering the biological activity, whereas an "essential" amino acid residue is required for 
biological activity. For example, amino acid residues that are conserved among the 

,30 TLCC-4 or TLCC-5 polypeptides of the present invention, e.g., those present in a 
transmembrane domain, are predicted to be particularly unamenable to alteration. 
Furthermore, additional amino acid residues that are conserved between the TLCC-4 or 
TLCC-5 polypeptides of the present invention and other members of the TLCC-4 or 
TLCC-5 family are not likely to be amenable to alteration. 

35 Accordingly, another aspect of the invention pertains to nucleic acid molecules 

encoding TLCC-4 or TLCC-5 polypeptides that contain changes in amino acid residues 
that are not essential for activity. Such TLCC-4 or TLCC-5 polypeptides differ in amino 
acid sequence from SEQ ID NO:2 or 5, yet retain biological activity. In one 
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embodiment, the isolated nucleic acid molecule comprises a nucleotide sequence 
encoding a polypeptide, wherein the polypeptide comprises an amino acid sequence at 
least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 
99% or more identical to SEQ ID NO:2 or 5 (e.g., to the entire length of SEQ ID NO:2 or 
5 5). 

An isolated nucleic acid molecule encoding a TLCC-4 or TLCC-5 polypeptide 
identical to the polypeptide of SEQ ID NO:2 or 5, can be created by introducing one or 
more nucleotide substitutions, additions or deletions into the nucleotide sequence of SEQ 
ID NO:l, 3, 4, or 6, or the nucleotide sequences of the DNA inserts of the plasmids 

10 deposited with ATCC as Accession Numbers , such that one or more amino acid 

substitutions, additions or deletions are introduced into the encoded polypeptide. 
Mutations can be introduced into SEQ ID NO:l, 3, 4, or 6, or the nucleotide sequences of 

the DNA inserts of the plasmids deposited with ATCC as Accession Numbers by 

standard techniques, such as site-directed mutagenesis and PCR-mediated mutagenesis. 

15 Preferably, conservative amino acid substitutions are made at one or more predicted non- 
essential amino acid residues. A "conservative amino acid substitution 11 is one in which 
the amino acid residue is replaced with an amino acid residue having a similar side chain. 
Families of amino acid residues having similar side chains have been defined in the art. 
These families include amino acids with basic side chains (e.g., lysine, arginine, 

20 histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side 
chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine, 
tryptophan), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, 
phenylalanine, methionine), beta-branched side chains (e.g., threonine, valine, isoleucine) 
and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Thus, a 

25 predicted nonessential amino acid residue in a TLCC-4 or TLCC-5 polypeptide is 
preferably replaced with another amino acid residue from the same side chain family. 
Alternatively, in another embodiment, mutations can be introduced randomly along all or 
part of a TLCC-4 or TLCC-5 coding sequence, such as by saturation mutagenesis, and 
the resultant mutants can be screened for TLCC-4 or TLCC-5 biological activity to 

30 identify mutants that retain activity. Following mutagenesis of SEQ ID NO:l, 3, 4, or 6, 
or the nucleotide sequences of the DNA inserts of the plasmids deposited with ATCC as 

Accession Numbers , the encoded polypeptide can be expressed recombinantly and 

the activity of the polypeptide can be determined. 

In a preferred embodiment, a mutant TLCC-4 or TLCC-5 polypeptide can be 

35 assayed for the ability to (1) modulate membrane excitability, (2) influence the resting 
potential of membranes, (3) modulate wave forms and frequencies of action potentials, 
(4) modulate thresholds of excitation, (5) modulate neurite outgrowth and 
synaptogenesis, (6) modulate signal transduction, and (7) participate in nociception. 
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In addition to the nucleic acid molecules encoding TLCC-4 or TLCC-5 
polypeptides described above, another aspect of the invention pertains to isolated nucleic 
acid molecules which are antisense thereto. In an exemplary embodiment, the invention 
provides an isolated nucleic acid molecule which is antisense to a TLCC-4 or TLCC-5 
5 nucleic acid molecule (e.g., is antisense to the coding strand of a TLCC-4 or TLCC-5 
nucleic acid molecule). An "antisense" nucleic acid comprises a nucleotide sequence 
which is complementary to a "sense" nucleic acid encoding a polypeptide, e.g., 
complementary to the coding strand of a double-stranded cDNA molecule or 
complementary to an mRNA sequence. Accordingly, an antisense nucleic acid can 

10 hydrogen bond to a sense nucleic acid. The antisense nucleic acid can be complementary 
to an entire TLCC-4 or TLCC-5 coding strand, or to only a portion thereof. In one 
embodiment, an antisense nucleic acid molecule is antisense to a "coding region" of the 
coding strand of a nucleotide sequence encoding TLCC-4 or TLCC-5. The term "coding 
region" refers to the region of the nucleotide sequence comprising codons which are 

15 translated into amino acid residues (e.g., the coding region of human TLCC-4 or TLCC-5 
corresponds to SEQ ID NO:3 or SEQ ID NO:6, respectively). In another embodiment, 
the antisense nucleic acid molecule is antisense to a "noncoding region" of the coding 
strand of a nucleotide sequence encoding TLCC-4 or TLCC-5. The term "noncoding 
region" refers to 5' and 3 1 sequences which flank the coding region that are not translated 

20 into amino acids (i.e., also referred to as 5' and 3' untranslated regions). 

Given the coding strand sequences encoding TLCC-4 or TLCC-5 disclosed herein 
(e.g., SEQ ID NO:3 or 6), antisense nucleic acids of the invention can be designed 
according to the rules of Watson and Crick base pairing. The antisense nucleic acid 
molecule can be complementary to the entire coding region of TLCC-4 or TLCC-5 

25 mRNA, but more preferably is an oligonucleotide which is antisense to only a portion of 
the coding or noncoding region of TLCC-4 or TLCC-5 mRNA. For example, the 
antisense oligonucleotide can be complementary to the region surrounding the translation 
start site of TLCC-4 or TLCC-5 mRNA (e.g., between the -10 and +10 regions of the 
start site of a gene nucleotide sequence). An antisense oligonucleotide can be, for 

30 example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense 
nucleic acid of the invention can be constructed using chemical synthesis and enzymatic 
ligation reactions using procedures known in the art. For example, an antisense nucleic 
acid (e.g., an antisense oligonucleotide) can be chemically synthesized using naturally 
occurring nucleotides or variously modified nucleotides designed to increase the 

35 biological stability of the molecules or to increase the physical stability of the duplex 
formed between the antisense and sense nucleic acids, e.g., phosphorothioate derivatives 
and acridine substituted nucleotides can be used. Examples of modified nucleotides 
which can be used to generate the antisense nucleic acid include 5-fluorouracil, 5- 
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bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4-acetylcytosine, 5- 
(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5- 
carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1 -methylguanine, 1 -methy linosine, 2,2-dimethylguanine, 2- 
5 methy ladenine, 2 -methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 
N6-isopenteny ladenine, uracil-5-oxy acetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, 

10 uracil-5- oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thio\iracil, 3- 
(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w 5 and 2,6-diaminopurine. Alternatively, 
the antisense nucleic acid can be produced biologically using an expression vector into 
which a nucleic acid has been subcloned in an antisense orientation (i.e., RNA 
transcribed from the inserted nucleic acid will be of an antisense orientation to a target 

15 nucleic acid of interest, described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered 
to a subject or generated in situ such that they hybridize with or bind to cellular mRNA 
and/or genomic DNA encoding a TLCC-4 or TLGC-5 polypeptide to thereby inhibit 
expression of the polypeptide, e.g,, by inhibiting transcription and/or translation. The 

20 hybridization can be by conventional nucleotide complementarity to form a stable 

duplex, or, for example, in the case of an antisense nucleic acid molecule which binds to 
DNA duplexes, through specific interactions in the major groove of the double helix. 
An example of a route of administration of antisense nucleic acid molecules of the 
invention include direct injection at a tissue site. Alternatively, antisense nucleic acid 

25 molecules can be modified to target selected cells and then administered systemically. 
For example, for systemic administration, antisense molecules can be modified such that 
they specifically bind to receptors or antigens expressed on a selected cell surface, e.g., 
by linking the antisense nucleic acid molecules to peptides or antibodies which bind to 
cell surface receptors or antigens. The antisense nucleic acid molecules can also be 

30 delivered to cells using the vectors described herein. To achieve sufficient intracellular 
concentrations of the antisense molecules, vector constructs in which the antisense 
nucleic acid molecule is placed under the control of a strong pol II or pol III promoter 
are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention 
35 is an ot-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms 
specific double-stranded hybrids with complementary RNA in which, contrary to the 
usual P-units, the strands run parallel to each other (Gaultier et al (1987) Nucleic Acids. 
Res. 15:6625-6641). The antisense nucleic acid molecule can also comprise a 2 , -o- 
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methylribonucleotide (Inoue etal (1987) Nucleic Acids Res. 15:6131-6148) or a 
chimeric RNA-DNA analogue (Inoue etal (1987) FEES Lett. 215:327-330). 

In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity which are 
5 capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they 
have a complementary region. Thus, ribozymes {e.g., hammerhead ribozymes 
(described in Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to 
catalytically cleave TLCC-4 or TLCC-5 mRNA transcripts to thereby inhibit translation 
of TLCC-4 or TLCC-5 mRNA. A ribozyme having specificity for a TLCC-4-encoding 

10 or TLCC-5-encoding nucleic acid can be designed based upon the nucleotide sequence 
of a TLCC-4 or TLCC-5 cDNA disclosed herein (i.e., SEQ ID NO:l, 3, 4, or 6, or the 
nucleotide sequences of the DNA inserts of the plasmids deposited with ATCC as 

Accession Numbers ). For example, a derivative of a Tetrahymena L-19 IVS 

RNA can be constructed in which the nucleotide sequence of the active site is 

15 complementary to the nucleotide sequence to be cleaved in a TLCC-4-encoding or 

TLCC-5-encoding mRNA (see, e.g., Cech etal. U.S. Patent No. 4,987,071; and Cech et 
al U.S. Patent No. 5,1 16,742). Alternatively, TLCC-4 or TLCC-5 mRNA can be used 
to select a catalytic RNA having a specific ribonuclease activity from a pool of RNA 
molecules. See, e.g., Bartel, D. and Szostak, J.W. (1993) Science 261:1411-1418. 

20 Alternatively, TLCC-4 or TLCC-5 gene expression can be inhibited by targeting 

nucleotide sequences complementary to the regulatory region of the TLCC-4 or TLCC-5 
(e.g., the TLCC-4 or TLCC-5 promoter and/or enhancers) to form triple helical 
structures that prevent transcription of the TLCC-4 or TLCC-5 gene in target cells (see, 
generally, Helene, C. (1991) Anticancer Drug Des. 6(6):569-84; Helene, C. etal (1992) 

25 Ann. NY. Acad. Sci. 660:27-36; and Maher, L.J. (1992) Bioassays 14(12):807-15). 

In yet another embodiment, the TLCC-4 or TLCC-5 nucleic acid molecules of 
the present invention can be modified at the base moiety, sugar moiety or phosphate 
backbone to improve, e.g., the stability, hybridization, or solubility of the molecule. For 
example, the deoxyribose phosphate backbone of the nucleic acid molecules can be 

30 modified to generate peptide nucleic acids (see, e.g., Hyrup B. et al (1996) Bioorganic 
& Medicinal Chemistry 4 (1): 5-23). As used herein, the terms "peptide nucleic acids" 
1 or "PNAs" refer to nucleic acid mimics, e.g., DNA mimics, in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide backbone and only the four natural 
nucleobases are retained. The neutral backbone of PNAs has been shown to allow for 

35 specific hybridization to DNA and RNA under conditions of low ionic strength. The 
synthesis of PNA oligomers can be performed using standard solid phase peptide 
synthesis protocols as described in Hyrup B. et al. (1996) supra, Perry-O'Keefe et al. 
Proc. Natl Acad. Sci. 93: 14670-675. 
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PNAs of TLCC-4 or TLCC-5 nucleic acid molecules can be used in therapeutic 
and diagnostic applications. For example, PNAs can be used as antisense or antigene 
agents for sequence-specific modulation of gene expression by, for example, inducing 
transcription or translation arrest or inhibiting replication. PNAs of TLCC-4 or TLCC-5 
5 nucleic acid molecules can also be used in the analysis of single base pair mutations in a 
gene, (e.g., by PNA-directed PCR clamping); as 'artificial restriction enzymes' when 
used in combination with other enzymes, (e.g., SI nucleases (Hyrup B. (1996) supra)); 
or as probes or primers for DNA sequencing or hybridization (Hyrup B. et al. (1996) 
supra; Perry-O'Keefe supra). 

10 In another embodiment, PNAs of TLCC-4 or TLCC-5 can be modified, (e.g., to 

enhance their stability or cellular uptake), by attaching lipophilic or other helper groups 
to PNA, by the formation of PNA-DNA chimeras, or by the use .of liposomes or other 
techniques of drug delivery known in the art. For example, PNA-DNA chimeras of 
TLCC-4 or TLCC-5 nucleic acid molecules can be generated which may combine the 

15 advantageous properties of PNA and DNA. Such chimeras allow DNA recognition 
enzymes, (e.g., RNase H and DNA polymerases), to interact with the DNA portion 
while the PNA portion would provide high binding affinity and specificity. PNA-DNA 
chimeras can be linked using linkers of appropriate lengths selected in terms of base 
stacking, number of bonds between the nucleobases, and orientation (Hyrup B. (1996) : 

20 supra). The synthesis of PNA-DNA chimeras can be performed as described in Hyrup 
B. (1996) supra and Finn P J. et al (1996) Nucleic Acids Res. 24 (17): 3357-63. For 
example, a DNA chain can be synthesized on a solid support using standard 
phosphoramidite coupling chemistry and modified nucleoside analogs, e.g., 5'-(4- 
methoxytrityl)amino-5 f -deoxy-thymidine phosphoramidite, can be used as a between the 

25 PNA and the 5' end of DNA (Mag, M. et al (1989) Nucleic Acid Res. 17: 5973-88). 
PNA monomers are then coupled in a stepwise manner to produce a chimeric molecule 
with a 5 f PNA segment and a 3' DNA segment (Finn P. J. et al (1996) supra). 
Alternatively, chimeric molecules can be synthesized with a 5 ! DNA segment and a 3' 
PNA segment (Peterser, K.H. et al (1975) Bioorganic Med. Chem. Lett. 5:1119- 

30 11124). 

In other embodiments, the oligonucleotide may include other appended groups 
such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 
transport across the cell membrane (see, e.g., Letsinger et al. (1989) Proc. Natl Acad. 
Set USA 86:6553-6556; Lemaitre et al (1987) Proc. Natl. Acad. Sci. USA 84:648-652; 
35 PCT Publication No. W088/098 1 0) or the blood-brain barrier (see, e.g. , PCT Publication 
No. W089/10134). In addition, oligonucleotides can be modified with hybridization- 
triggered cleavage agents (see, e.g., Krol et al. (1988) Bio-Techniques 6:958-976) or 
. intercalating agents (see, e.g., Zon (1 988) Pharm. Res. 5:539-549). To this end, the 

26 
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oligonucleotide may be conjugated to another molecule, (e.g. , a peptide, hybridization 
triggered cross-linking agent, transport agent, or hybridization-triggered cleavage agent). 

Alternatively, the expression characteristics of an endogenous TLCC-4 or 
TLCC-5 gene within a cell line or microorganism may be modified by inserting a 
5 heterologous DNA regulatory element into the genome of a stable cell line or cloned 
microorganism such that the inserted regulatory element is operatively linked with the 
endogenous TLCC-4 or TLCC-5 gene. For example, an endogenous TLCC-4 or TLCC- 
5 gene which is normally "transcriptionally silent", i.e., a TLCC-4 or TLCC-5 gene 
which is normally not expressed, or is expressed only at very low levels in a cell line or 
10 microorganism, may be activated by inserting a regulatory element which is capable of 
promoting the expression of a normally expressed gene product in that cell line or 
microorganism. Alternatively, a transcriptionally silent, endogenous TLCC-4 or TLCC- 
5 gene may be activated by insertion of a promiscuous regulatory element that works 
across cell types. 

15 A heterologous regulatory element may be inserted into a stable cell line or 

cloned microorganism, such that it is operatively linked with an endogenous TLCC-4 or 
TLCC-5 gene, using techniques, such as targeted homologous recombination, which are 
well known to those of skill in the art, and described, e.g., in Chappel, U.S. Patent No. 
5,272,071; PCT publication No. WO 91/06667, published May 16, 1991. 

20 

II. Isolated TLCC-4 and TLCC-5 Polypeptides and Anti-TLCC-4 and Anti-TLCC-5 
Antibodies 

One aspect of the invention pertains to isolated or recombinant TLCC-4 and 
TLCC-5 polypeptides, and biologically active portions thereof, as well as polypeptide 

25 fragments suitable for use as immunogens to raise anti-TLCC-4 or anti-TLCC-5 

antibodies. In one embodiment, native TLCC-4 or TLCC-5 polypeptides can be isolated 
from cells or tissue sources by an appropriate purification scheme using standard 
polypeptide purification techniques. In another embodiment, TLCC-4 or TLCC-5 
polypeptides are produced by recombinant DNA techniques. Alternative to recombinant 

30 expression, a TLCC-4 or TLCC-5 polypeptide can be synthesized chemically using 
standard peptide synthesis techniques. 

An "isolated" or "purified" polypeptide or biologically active portion thereof is 
substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the TLCC-4 or TLCC-5 polypeptide is derived, or 

35 substantially free from chemical precursors or other chemicals when chemically 

synthesized. The language "substantially free of cellular material" includes preparations 
of TLCC-4 or TLCC-5 polypeptide in which the polypeptide is separated from cellular 
components of the cells from which it is isolated or recombinantly produced. In one 

27 
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embodiment, the language "substantially free of cellular material" includes preparations 
of TLCC-4 or TLCC-5 polypeptide having less than about 30% (by dry weight) of non- 
TLCC-4 or non-TLCC-5 polypeptide (also referred to herein as a "contaminating 
protein"), more preferably less than about 20% of non-TLCC-4 or non-TLCC-5 
5 polypeptide, still more preferably less than about 10% of non-TLCC-4 or non-TLCC-5 
polypeptide, and most preferably less than about 5% non-TLCC-4 or non-TLCC-5 
polypeptide. When the TLCC-4 or TLCC-5 polypeptide or biologically active portion 
thereof is recombinantly produced, it is also preferably substantially free of culture 
medium, i.e., culture medium represents less than about 20%, more preferably less than 
10 about 10%, and most preferably less than about 5% of the volume of the polypeptide 
preparation. 

The language "substantially free of chemical precursors or other chemicals" 
includes preparations of TLCC-4 or TLCC-5 polypeptide in which the polypeptide is 
separated from chemical precursors or other chemicals which are involved in the 

15 synthesis of the polypeptide. In one embodiment, the language "substantially free of 
chemical precursors or other chemicals" includes preparations of TLCC-4 or TLCC-5 
polypeptide having less than about 30% (by dry weight) of chemical precursors or non- 
TLCC-4 or non-TLCC-5 chemicals, more preferably less than about 20% chemical 
precursors or non-TLCC-4 or non-TLCC-5 chemicals, still more preferably less than 

20 about 10% chemical precursors or non-TLCC-4 or non-TLCC-5 chemicals, and most 
preferably less than about 5% chemical precursors or non-TLCC-4 or non-TLCC-5 
chemicals. 

As used herein, a "biologically active portion" of a TLCC-4 or TLCC-5 
polypeptide includes a fragment of a TLCC-4 or TLCC-5 polypeptide which participates 

25 in an interaction between a TLCC-4 or TLCC-5 molecule and a non-TLCC-4 or non- 
TLCC-5 molecule. Biologically active portions of a TLCC-4 or TLCC-5 polypeptide 
include peptides comprising amino acid sequences sufficiently identical to or derived 
from the amino acid sequence of the TLCC-4 or TLCC-5 polypeptide, e.g., the amino 
acid sequence shown in SEQ ID NO:2 or 5, which include less amino acids than the full 

30 length TLCC-4 or TLCC-5 polypeptides, and exhibit at least one activity of a TLCC-4 
or TLCC-5 polypeptide. Typically, biologically active portions comprise a domain or 
motif with at least one activity of the TLCC-4 or TLCC-5 polypeptide, e.g., modulating 
membrane excitation mechanisms. A biologically active portion of a TLCC-4 or TLCC- 
5 polypeptide can be a polypeptide which is, for example, 25, 30, 35, 40, 45, 50, 75, 

35 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 
525, 550, 575, 600, 625, 650, 675, 700, 725, 750, 775, 800, 825, 850, 875, 900, 925, 
950, 975, 1000 or more amino acids in length. Biologically active portions of a TLCC-4 
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or TLCC-5 polypeptide can be used as targets for developing agents which modulate a 
TLCC-4 or TLCC-5 mediated activity, e.g., a membrane excitation mechanism. 

In one embodiment, a biologically active portion of a TLCC-4 or TLCC-5 
polypeptide comprises at least one transmembrane domain. It is to be understood that a 
5 preferred biologically active portion of a TLCC-4 or TLCC-5 polypeptide of the present 
invention comprises at least one or more of the following domains: an ankyrin repeat 
domain, a transmembrane domain, a pore domain, a transient receptor domain, and/or an 
ion transport protein domain. Moreover, other biologically active portions, in which 
other regions of the polypeptide are deleted, can be prepared by recombinant techniques 
10 and evaluated for one or more of the functional activities of a native TLCC-4 or TLCC-5 
. polypeptide. 

Another aspect of the invention features fragments of the protein having the 
amino acid sequence of SEQ ID NO:2 or 5, for example, for use as immunogens. In one 
embodiment, a fragment comprises at least 5 amino acids (e.g., contiguous or 

15 consecutive amino acids) of the amino acid sequence of SEQ ID NO:2 or 5, or amino 
acid sequences encoded by the DNA inserts of the plasmids deposited with the ATCC as 

Accession Numbers . In another embodiment, a fragment comprises at least 10, 

15, 20, 25, 30, 35, 40, 45, 50 or more amino acids (e.g., contiguous or consecutive 
amino acids) of the amino acid sequence of SEQ ID NO:2 or 5, or amino acid sequences 

20 encoded by the DNA inserts of the plasmids deposited with the ATCC as Accession 
Numbers . 

In a preferred embodiment, a TLCC-4 or TLCC-5 polypeptide has an amino acid 
sequence shown in SEQ ID NO:2 or 5. In other embodiments, the TLCC-4 or TLCC-5 
polypeptide is substantially identical to SEQ ID NO:2 or 5, and retains the functional 

25 activity of the polypeptide of SEQ ID NO:2 or 5, yet differs in amino acid sequence due 
to natural allelic variation or mutagenesis, as described in detail in subsection I above. 
In another embodiment, the TLCC-4 or TLCC-5 polypeptide is a polypeptide which 
comprises an amino acid sequence at least about 50%, 55%, 60%, 65%, 70%, 75%, 
80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or more identical to SEQ ID NO:2 or 5. 

30 In another embodiment, the invention features a TLCC-4 or TLCC-5 polypeptide 

which is encoded by a nucleic acid molecule consisting of a nucleotide sequence at least 
about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% 
or more identical to a nucleotide sequence of SEQ ID NO: 1 , 3, 4, or 6, or a complement 
thereof. This invention further features a TLCC-4 or TLCC-5 polypeptide which is 

35 encoded by a nucleic acid molecule consisting of a nucleotide sequence which 

hybridizes under stringent hybridization conditions to a complement of a nucleic acid 
molecule comprising the nucleotide sequence of SEQ ID NO:l, 3, 4, or 6, or a 
complement thereof 
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To determine the percent identity of two amino acid sequences or of two nucleic 
acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps 
can be introduced in one or both of a first and a second amino acid or nucleic acid 
sequence for optimal alignment and non-identical sequences can be disregarded for 
5 comparison purposes). In a preferred embodiment, the length of a reference sequence 
aligned for comparison purposes is at least 30%, preferably at least 40%, more 
preferably at least 50%, even more preferably at least 60%, and even more preferably at 
least 70%, 80%, or 90% of the length of the reference sequence (e.g., when aligning a 
second sequence to the TLCC-4 amino acid sequence of SEQ ID NO:2 having 742 

10 amino acid residues, at least 222, preferably at least 296, more preferably at least 371, 
more preferably at least 445, even more preferably at least 519, and even more 
preferably at least 593 or 667 or more amino acid residues are aligned; when aligning a 
second sequence to the TLCC-5 amino acid sequence of SEQ ID NO: 5 having 1013 
amino acid residues, at least 304, preferably at least 405, more preferably at least 506, 

15 more preferably at least 608, even more preferably at least 709, and even more 

preferably at least 810 or 912 or more amino acid residues are aligned). The amino acid 
residues or nucleotides at corresponding amino acid positions or nucleotide positions are 
then compared. When a position in the first sequence is occupied by the same amino 
acid residue or nucleotide as the corresponding position in the second sequence, then the 

20 molecules are identical at that position (as used herein amino acid or nucleic acid 

"identity" is equivalent to amino acid or nucleic acid "homology"). The percent identity 
between the two sequences is a function of the number of identical positions shared by 
the sequences, taking into account the number of gaps, and the length of each gap, 
which need to be introduced for optimal alignment of the two sequences. 

25 The comparison of sequences and determination of percent identity between two 

sequences can be accomplished using a mathematical algorithm. In a preferred 
embodiment, the percent identity between two amino acid sequences is determined 
using the Needleman and Wunsch (J. Mol Biol. (48):444-453 (1970)) algorithm which 
has been incorporated into the GAP program in the GCG software package (available at 

30 the GCG website), using either a Blosum 62 matrix or a PAM250 matrix, and a gap 
weight of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet 
another preferred embodiment, the percent identity between two nucleotide sequences is 
determined using the GAP program in the GCG software package (available at the GCG 
website), using a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 80 and 

35 a length weight of 1 , 2, 3, 4, 5, or 6. A preferred, non-limiting example of parameters to 
be used in conjunction with the GAP program include a Blosum 62 scoring matrix with 
a gap penalty of 12, a gap extend penalty of 4, and a frameshift gap penalty of 5. 
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In another embodiment, the percent identity between two amino acid or 
nucleotide sequences is determined using the algorithm of E. Meyers and W. Miller 
(Comput. Appl Biosci., 4:1 1-17 (1988)) which has been incorporated into the ALIGN 
program (version 2.0 or version 2.0U), using a PAM120 weight residue table, a gap 
5 length penalty of 12 and a gap penalty of 4. 

The nucleic acid and polypeptide sequences of the present invention can further 
be used as a "query sequence" to perform a search against public databases to, for 
example, identify other family members or related sequences. Such searches can be 
performed using the NBLAST and XBLAST programs (version 2.0) of Altschul, et ah 

10 (1990) J. Mol Biol 215:403-10. BLAST nucleotide searches can be performed with the 
NBLAST program, score = 100, wordlength = 12 to obtain nucleotide sequences 
homologous to TLCC-4 or TLCC-5 nucleic acid molecules of the invention. BLAST 
protein searches can be performed with the XBLAST program, score = 100, wordlength 
= 3, and a Blosum62 matrix to obtain amino acid sequences homologous to TLCC-4 or 

15 TLCC-5 polypeptide molecules of the invention. To obtain gapped alignments for 
comparison purposes, Gapped BLAST can be utilized as described in Altschul et al 9 
(1 997) Nucleic Acids Res. 25(1 7):3389-3402. When utilizing BLAST and Gapped 
BLAST programs, the default parameters of the respective programs (e.g., XBLAST and 
NBLAST) can be used. See the NIH website. 

20 The invention also provides TLCC-4 or TLCC-5 chimeric or fusion proteins. As 

used herein, a TLCC-4 or TLCC-5 "chimeric protein" or "fusion protein" comprises a 
TLCC-4 or TLCC-5 polypeptide operatively linked to a non-TLCC-4 or non-TLCC-5 
polypeptide. A "TLCC-4 or TLCC-5 polypeptide" refers to a polypeptide having an 
amino acid sequence corresponding to TLCC-4 or TLCC-5, whereas a "non-TLCC-4 or 

25 non-TLCC-5 polypeptide" refers to a polypeptide having an amino acid sequence 
corresponding to a protein which is not substantially homologous to the TLCC-4 or 
TLCC-5 polypeptide, e.g., a protein which is different from the TLCC-4 or TLCC-5 
polypeptide and which is derived from the same or a different organism. Within a 
TLCC-4 or TLCC-5 fusion protein the TLCC-4 or TLCC-5 polypeptide can correspond 

30 to all or a portion of a TLCC-4 or TLCC-5 polypeptide. In a preferred embodiment, a 
TLCC-4 or TLCC-5 fusion protein comprises at least one biologically active portion of a 
TLCC-4 or TLCC-5 polypeptide. In another preferred embodiment, a TLCC-4 or 
TLCC-5 fusion protein comprises at least two biologically active portions of a TLCC-4 
or TLCC-5 polypeptide. Within the fusion protein, the term "operatively linked" is 

35 intended to indicate that the TLCC-4 or TLCC-5 polypeptide and the non-TLCC-4 or 
non-TLCC-5 polypeptide are fused in-frame to each other. The non-TLCC-4 or non- 
TLCC-5 polypeptide can be fused to the N-terminus or C-terminus of the TLCC-4 or 
TLCC-5 polypeptide. 
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For example, in one embodiment, the fusion protein is a GST-TLCC-4 or GST- 
TLCC-5 fusion protein in which the TLCC-4 or TLCC-5 sequences are fused to the C- 
terminus of the GST sequences. Such fusion proteins can facilitate the purification of 
recombinant TLCC-4 or TLCC-5. 
5 In another embodiment, the fusion protein is a TLCC-4 or TLCC-5 polypeptide 

containing a heterologous signal sequence at its N-terminus. In certain host cells (e.g., 
mammalian host cells), expression and/or secretion of TLCC-4 or TLCC-5 can be 
increased through the use of a heterologous signal sequence. 

The TLCC-4 or TLCC-5 fusion proteins of the invention can be incorporated 

10 into pharmaceutical compositions and administered to a subject in vivo. The TLCC-4 or 
TLCC-5 fusion proteins can be used to affect the bioavailability of a TLCC-4 or TLCC- 
5 substrate. Use of TLCC-4 or TLCC-5 fusion proteins may be useful therapeutically 
for the treatment of disorders caused by, for example, (i) aberrant modification or 
mutation of a gene encoding a TLCC-4 or TLCC-5 polypeptide; (ii) mis-regulation of 

15 the TLCC-4 or TLCC-5 gene; and (hi) aberrant post-translational modification of a 
TLCC-4 or TLCC-5 polypeptide. 

Moreover, the TLCC-4-fusion proteins or TLCC-5-fusion proteins of the 
invention can be used as immunogens to produce anti-TLCC-4 or anti-TLCC-5 
antibodies in a subject, to purify TLCC-4 or TLCC-5 ligands and in screening assays to 

20 identify molecules which inhibit the interaction of TLCC-4 or TLCC-5 with a TLCC-4 
or TLCC-5 substrate. 

Preferably, a TLCC-4 or TLCC-5 chimeric or fusion protein of the invention is 
produced by standard recombinant DNA techniques. For example, DNA fragments 
coding for the different polypeptide sequences are ligated together in-frame in 

25 accordance with conventional techniques, for example by employing blunt-ended or 
stagger-ended termini for ligation, restriction enzyme digestion to provide for 
appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase 
treatment to avoid undesirable joining, and enzymatic ligation. In another embodiment, 
the fusion gene can be synthesized by conventional techniques including automated 

30 DNA synthesizers. Alternatively, PCR amplification of gene fragments can be carried 
out using anchor primers which give rise to complementary overhangs between two 
consecutive gene fragments which can subsequently be annealed and reamplified to 
generate a chimeric gene sequence (see, for example, Current Protocols in Molecular 
Biology, eds. Ausubel et al John Wiley & Sons: 1992). Moreover, many expression 

35 vectors are commercially available that already encode a fusion moiety (e.g., a GST 

polypeptide). A TLCC-4-encoding or TLCC-5-encoding nucleic acid can be cloned into 
such an expression vector such that the fusion moiety is linked in-frame to the TLCC-4 
or TLCC-5 polypeptide. 
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The present invention also pertains to variants of the TLCC-4 or TLCC-5 
polypeptides which function as either TLCC-4 or TLCC-5 agonists (mimetics) or as 
TLCC-4 or TLCC-5 antagonists. Variants of the TLCC-4 or TLCC-5 polypeptides can 
be generated by mutagenesis, e.g., discrete point mutation or truncation of a TLCC-4 or 
5 TLCC-5 polypeptide. An agonist of the TLCC-4 or TLCC-5 polypeptides can retain 
substantially the same, or a subset, of the biological activities of the naturally occurring 
form of a TLCC-4 or TLCC-5 polypeptide. An antagonist of a TLCC-4 or TLCC-5 
polypeptide can inhibit one or more of the activities of the naturally occurring form of 
the TLCC-4 or TLCC-5 polypeptide by, for example, competitively modulating a 

1 0 TLCC-4-mediated activity or TLCC-5-mediated activity of a TLCC-4 or TLCC-5 

polypeptide. Thus, specific biological effects can be elicited by treatment with a variant 
of limited function. In one embodiment, treatment of a subject with a variant having a 
subset of the biological activities of the naturally occurring form of the polypeptide has 
fewer side effects in a subject relative to treatment with the naturally occurring form of 

1 5 the TLCC-4 or TLCC-5 polypeptide. 

In one embodiment, variants of a TLCC-4 or TLCC-5 polypeptide which 
function as either TLCC-4 or TLCC-5 agonists (mimetics) or as TLCC-4 or TLCC-5 
antagonists can be identified by screening combinatorial libraries of mutants, e.g., 
truncation mutants, of a TLCC-4 or TLCC-5 polypeptide for TLCC-4 or TLCC-5 

20 polypeptide agonist or antagonist activity. In one embodiment, a variegated library of 
TLCC-4 or TLCC-5 variants is generated by combinatorial mutagenesis at the nucleic 
acid level and is encoded by a variegated gene library. A variegated library of TLCC-4 
or TLCC-5 variants can be produced by, for example, enzymatically ligating a mixture 
of sy nthetic oligonucleotides into gene sequences such that a degenerate set of potential 

25 TLCC-4 or TLCC-5 sequences is expressible as individual polypeptides, or 

alternatively, as a set of larger fusion proteins (e.g.,. for phage display) containing the set 
of TLCC-4 or TLCC-5 sequences therein. There are a variety of methods which can be 
used to produce libraries of potential TLCC-4 or TLCC-5 variants from a degenerate 
oligonucleotide sequence. Chemical synthesis of a degenerate gene sequence can be 

30 performed in an automatic DNA synthesizer, and the synthetic gene then ligated into an 
appropriate expression vector. Use of a degenerate set of genes allows for the provision, 
in one mixture, of all of the sequences encoding the desired set of potential TLCC-4 or 
TLCC-5 sequences. Methods for synthesizing degenerate oligonucleotides are known in 
the art (see, e.g., Narang, S.A. (1983) Tefrahedron 39:3; Itakura et al (1984) Annu. Rev. 

35 Biochem. 53:323; Itakura et al. (1984) Science 198:1056; Ike et al. (1983) Nucleic Acid 
Res. 11:477. 

In addition, libraries of fragments of a TLCC-4 or TLCC-5 polypeptide coding 
sequence can be used to generate a variegated population of TLCC-4 or TLCC-5 - 
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fragments for screening and subsequent selection of variants of a TLCC-4 or TLCC-5 
polypeptide. In one embodiment, a library of coding sequence fragments can be 
generated by treating a double stranded PCR fragment of a TLCC-4 or TLCC-5 coding 
sequence with a nuclease under conditions wherein nicking occurs only about once per 
5 molecule, denaturing the double stranded DNA, renaturing the DNA to form double 
stranded DNA which can include sense/antisense pairs from different nicked products, 
removing single stranded portions from reformed duplexes by treatment with SI 
nuclease, and ligating the resulting fragment library into an expression vector. By this 
method, an expression library can be derived which encodes N-terminal, C-terminal and 

10 internal fragments of various sizes of the TLCC-4 or TLCC-5 polypeptide. 

Several techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for' gene products having a selected property. Such techniques are adaptable for 
rapid screening of the gene libraries generated by the combinatorial mutagenesis of 

15 TLCC-4 or TLCC-5 polypeptides. The most widely used techniques, which are 
amenable to high through-put analysis, for screening large gene libraries typically 
include cloning the gene library into replicable expression vectors, transforming 
appropriate cells with the resulting library of vectors, and expressing the combinatorial 
genes under conditions in which detection of a desired activity facilitates isolation of the 

20 vector encoding the gene whose product was detected. Recursive ensemble mutagenesis 
(REM), a new technique which enhances the frequency of functional mutants in the 
libraries, can be used in combination with the screening assays to identify TLCC-4 or 
TLCC-5 variants (Arkin and Yourvan (1992) Proc. Natl. Acad. Sci. USA 59:7811-7815; 
Delgrave et ah (1993) Protein Engineering 6(3):327-331). 

25 In one embodiment, cell based assays can be exploited to analyze a variegated 

TLCC-4 or TLCC-5 library. For example, a library of expression vectors can be 
transfected into a cell line, e.g., an endothelial cell line, which ordinarily responds to 
TLCC-4 or TLCC-5 in a particular TLCC-4 or TLCC-5 substrate-dependent manner. 
The transfected cells are then contacted with TLCC-4 or TLCC-5 and the effect of 

30 expression of the mutant on signaling by the TLCC-4 or TLCC-5 substrate can be 

detected, e.g., by monitoring intracellular calcium, IP3, or diacylglycerol concentration, 
phosphorylation profile of intracellular proteins, or the activity of a TLCC-4-regulated 
or TLCC-5-regulated transcription factor. Plasmid DNA can then be recovered from the 
cells which score for inhibition, or alternatively, potentiation of signaling by the TLCC- 

35 4 or TLCC-5 substrate, and the individual clones further characterized. 

An isolated TLCC-4 or TLCC-5 polypeptide, or a portion or fragment thereof, 
can be used as an immunogen to generate antibodies that bind TLCC-4 or TLCC-5 using 
standard techniques for polyclonal and monoclonal antibody preparation. A full-length 
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TLCC-4 or TLCC-5 polypeptide can be used or, alternatively, the invention provides 
antigenic peptide fragments of TLCC-4 or TLCC-5 for use as immunogens. The 
antigenic peptide of TLCC-4 or TLCC-5 comprises at least 8 amino acid residues of the 
amino acid sequence shown in SEQ ID NO:2 or 5 and encompasses an epitope of 
5 TLCC-4 or TLCC-5 such that an antibody raised against the peptide forms a specific 
immune complex with TLCC-4 or TLCC-5. Preferably, the antigenic peptide comprises 
at least 10 amino acid residues, more preferably at least 15 amino acid residues, even 
more preferably at least 20 amino acid residues, and most preferably at least 30 amino 
acid residues. 

10 Preferred epitopes encompassed by the antigenic peptide are regions of TLCC-4 

or TLCC-5 that are located on the surface of the polypeptide, e.g., hydrophilic regions, 
as well as regions with high antigenicity (see, for example, Figures 4 and 7). 

A TLCC-4 or TLCC-5 immunogen typically is used to prepare antibodies by 
immunizing a suitable subject, (e.g., rabbit, goat, mouse or other mammal) with the 

15 immunogen. An appropriate immunogenic preparation can contain, for example, 

recombinantly expressed TLCC-4 or TLCC-5 polypeptide or a chemically synthesized 
TLCC-4 or TLCC-5 polypeptide. The preparation can further include an adjuvant, such 
as Freund's complete or incomplete adjuvant, or similar immunostimulatory agent. 
Immunization of a suitable subject with an immunogenic TLCC-4 or TLCC-5 

20 preparation induces a polyclonal anti-TLCC-4 or anti-TLCC-5 antibody response. 

Accordingly, another aspect of the invention pertains to anti-TLCC-4 or anti- 
TLCC-5 antibodies. The term "antibody" as used herein refers to immunoglobulin 
molecules and immunologically active portions of immunoglobulin molecules, i.e., 
molecules that contain an antigen binding site which specifically binds (immunoreacts 

25 with) an antigen, such as TLCC-4 or TLCC-5. Examples of immunologically active 
portions of immunoglobulin molecules include F(ab) and F(ab')2 fragments which can 

be generated by treating the antibody with an enzyme such as pepsin. The invention 
provides polyclonal and monoclonal antibodies that bind TLCC-4 or TLCC-5. The term 
"monoclonal antibody" or "monoclonal antibody composition", as used herein, refers to 

30 a population of antibody molecules that contain only one species of an antigen binding 
site capable of immunoreacting with a particular epitope of TLCC-4 or TLCC-5. A 
monoclonal antibody composition thus typically displays a single binding affinity for a 
particular TLCC-4 or TLCC-5 polypeptide with which it immunoreacts. 

Polyclonal anti-TLCC-4 or anti-TLCC-5 antibodies can be prepared as described 

35 above by immunizing a suitable subject with a TLCC-4 or TLCC-5 immunogen. The 
anti-TLCC-4 or anti-TLCC-5 antibody titer in the immunized subject can be monitored 
over time by standard techniques, such as with an enzyme linked immunosorbent assay 
(ELISA) using immobilized TLCC-4 or TLCC-5. If desired, the antibody molecules 
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directed against TLCC-4 or TLCC-5 can be isolated from the mammal (e.g., from the 
blood) and further purified by well known techniques, such as protein A 
chromatography to obtain the IgG fraction. At an appropriate time after immunization, 
e.g., when the anti-TLCC-4 or anti-TLCC-5 antibody titers are highest, antibody- 
5 producing cells can be obtained from the subject and used to prepare monoclonal 

antibodies by standard techniques, such as the hybridoma technique originally described 
by Kohler and Milstein (1975) Nature 256:495-497) (see also, Brown et al (1981) J. 
Immunol. 127:539-46; Brown et al (1980) J. Biol Chem .255:4980-83; Yeh et al 
(1976) Proc. Natl Acad. Set. USA 76:2927-31; and Yeh etal (1982) Int. J. Cancer 

10 29:269-75), the more recent human B cell hybridoma technique (Kozbor et al (1983) 
Immunol Today 4:72), the EBV-hybridoma technique (Cole et al (1985), Monoclonal 
Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96) or trioma techniques. 
The technology for producing monoclonal antibody hybridomas is well known (see, 
generally, R. H. Kenneth, in Monoclonal Antibodies: A New Dimension In Biological 

15 Analyses, Plenum Publishing Corp., New York, New York (1980); E. A. Lerner (1981) 
Yale J. Biol Med. f 54:387-402; M. L. Gefter et al (1977) Somatic Cell Genet. 
3:231-36). Briefly, an immortal cell line (typically a myeloma) is fused to lymphocytes 
(typically splenocytes) from a mammal immunized with a TLCC-4 or TLCC-5 
immunogen as described above, and the culture supernatants of the resulting hybridoma 

20 cells are screened to identify a hybridoma producing a monoclonal antibody that binds 
TLCC-4 or TLCC-5. 

Any of the many well known protocols used for fusing lymphocytes and 
immortalized cell lines can be applied for the purpose of generating an anti-TLCC-4 or 
anti-TLCC-5 monoclonal antibody (see, e.g., G. Galfre et al. (1977) Nature 266:55052; 

25 Gefter et al Somatic Cell Genet. , cited supra; Lerner, Yale J. Biol Med, cited supra; 
Kenneth, Monoclonal Antibodies, cited supra). Moreover, the ordinarily skilled worker 
will appreciate that there are many variations of such methods which also would be 
useful. Typically, the immortal cell line (e.g., a myeloma cell line) is derived from the 
same mammalian species as the lymphocytes. For example, murine hybridomas can be 

30 made by fusing lymphocytes from a mouse immunized with an immunogenic 

preparation of the present invention with an immortalized mouse cell line. Preferred 
immortal cell lines are mouse myeloma cell lines that are sensitive to culture medium 
containing hypoxanthine, aminopterin and thymidine ("HAT medium"). Any of a 
number of myeloma cell lines can be used as a fusion partner according to standard 

35 techniques, e.g., the P3-NSl/l-Ag4-l, P3-x63-Ag8.653 or Sp2/0-Agl4 myeloma lines. 
These myeloma lines are available from ATCC. Typically, HAT-sensitive mouse 
myeloma cells are fused to mouse splenocytes using polyethylene glycol ("PEG"). 
Hybridoma cells resulting from the fusion are then selected using HAT medium, which 
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kills unfused and unproductively fused myeloma cells (unfiised splenocytes die after 
several days because they are not transformed). Hybridoma cells producing a 
monoclonal antibody of the invention are detected by screening the hybridoma culture 
supernatants for antibodies that bind TLCC-4 or TLCC-5, e.g., using a standard ELISA 
5 assay. 

Alternative to preparing monoclonal antibody-secreting hybridomas, a 
monoclonal anti-TLCC-4 or anti-TLCC-5 antibody can be identified and isolated by 
screening a recombinant combinatorial immunoglobulin library (e.g., an antibody phage 
display library) with TLCC-4 or TLCC-5 to thereby isolate immunoglobulin library 

10 members that bind TLCC-4 or TLCC-5. Kits for generating and screening phage 

display libraries are commercially available (e.g., the Pharmacia Recombinant Phage 
Antibody System, Catalog No. 27-9400-01 ; and the Stratagene SurfZAP™ Phage 
Display Kit, Catalog No. 240612). Additionally, examples of methods and reagents 
particularly amenable for use in generating and screening antibody display library can 

15 be found in, for example, Ladner et al. U.S. Patent No. 5,223,409; Kang et al. PCT 

International Publication No. WO 92/18619; Dower et al PCT International Publication 
No. WO 91/17271; Winter et al. PCT International Publication WO 92/20791; Markland 
et al PCT International Publication No. WO 92/1 5679; Breitling et al. PCT 
International Publication WO 93/01288; McCafferty et al. PCT International Publication 

20 No. WO 92/01047; Garrard et al. PCT International Publication No. WO 92/09690; 
Ladner a/. PCT International Publication No. WO 90/02809; Fuchs etal (1991) 
Bio/Technology 9:1370-1372; Hay etal (1992) Hum. Antibod. Hybridomas 3:81-85; 
Huseetal (1989) Science 246:1275-1281; Griffiths etal (1993) EMBO J 12:725-734; 
Hawkins e/tf/. (1992) J. Mol Biol 226:889-896; Clarksone/tf/. (\99X)Nature 352:624- 

25 628; Gram et al (1992) Proc. Natl Acad. Sci. USA 89:3576-3580; Garrad et al. (1991) 
Bio/Technology 9:1373-1377; Hoogenboom etal (1991) Nuc. Acid Res. 19:4133-4137; 
Barbas et al (1991) Proc. Natl. Acad. Sci. USA 88:7978-7982; and McCafferty et al. 
Nature (1990) 348:552-554. 

Additionally, recombinant anti-TLCC-4 or anti-TLCC-5 antibodies, such as 

30 chimeric and humanized monoclonal antibodies, comprising both human and non- 
human portions, which can be made using standard recombinant DNA techniques, are 
within the scope of the invention. Such chimeric and humanized monoclonal antibodies 
can be produced by recombinant DNA techniques known in the art, for example using 
methods described in Robinson et al International Application No. PCT/US86/02269; 

35 Akira, et al European Patent Application 184,187; Taniguchi, M., European Patent 
Application 171,496; Morrison et al European Patent Application 173,494; Neuberger 
etal PCT International Publication No. WO 86/01533; Cabilly etal U.S. Patent No. 
4,816,567; Cabilly et al. European Patent Application 125,023; Better et al (1988) 
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Science 240:1041-1043; Liu et al. (1987) Proc. Natl Acad. Set USA 84:3439-3443; Liu 
et al (1987) J. Immunol 139:3521-3526; Sun et al (1987) Proa Natl Acad Sci. USA 
84:214-218; Nishimura al (1987) Cane. Res. 47:999-1005; Woodetal (1985) 
Nature 314:446-449; and Shaw et al (1988) J. Natl Cancer Inst. 80:1553-1559); 
5 Morrison, S. L. (1985) Science 229: 1202-1207; Oi et al (1986) BioTechniques 4:214; 
Winter U.S. Patent 5,225,539; Jones et al. (1986) Nature 321 :552-525; Verhoeyan et al 
( 1 988) Science 239: 1534; and Beidler^ al (1988) J. Immunol 141:4053-4060. 

An anti-TLCC-4 or anti-TLCC-5 antibody (e.g., monoclonal antibody) can be 
used to isolate TLCC-4 or TLCC-5 by standard techniques, such as affinity 

10 chromatography or immunoprecipitation. An anti-TLCC-4 or anti-TLCC-5 antibody 
can facilitate the purification of natural TLCC-4 or TLCC-5 from cells and of 
recombinantly produced TLCC-4 or TLCC-5 expressed in host cells. Moreover, an anti- 
TLCC-4 or anti-TLCC-5 antibody can be used to detect TLCC-4 or TLCC-5 
polypeptide (e.g., in a cellular lysate or cell supernatant) in order to evaluate the 

15 abundance and pattern of expression of the TLCC-4 or TLCC-5 polypeptide. Anti- 
TLCC-4 or anti-TLCC-5 antibodies can be used diagnostically to monitor polypeptide 
levels in tissue as part of a clinical testing procedure, e.g., to, for example, determine the 
efficacy of a given treatment regimen. Detection can be facilitated by coupling (Le., 
physically linking) the antibody to a detectable substance. Examples of detectable 

20 substances include various enzymes, prosthetic groups, fluorescent materials, 

luminescent materials, bioluminescent materials, and radioactive materials. Examples 
of suitable enzymes include horseradish peroxidase, alkaline phosphatase, (3- 
galactosidase, or acetylcholinesterase; examples of suitable prosthetic group complexes 
include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials 

25 include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 

dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a 

luminescent material includes luminol; examples of bioluminescent materials include 

luciferase, luciferin, and aequorin, and examples of suitable radioactive material include 
125j 5 131 I? 35 Sor 3 H 

30 

HI. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, for example recombinant 
expression vectors, containing a TLCC-4 or TLCC-5 nucleic acid molecule or vectors 
containing a nucleic acid molecule which encodes a TLCC-4 or TLCC-5 polypeptide (or 
35 a portion thereof). As used herein, the term "vector" refers to a nucleic acid molecule 
capable of transporting another nucleic acid to which it has been linked. One type of 
vector is a "plasmid", which refers to a circular double stranded DNA loop into which 
additional DNA segments can be ligated. Another type of vector is a viral vector, 
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wherein additional DNA segments can be ligated into the viral genome. Certain vectors 
are capable of autonomous replication in a host cell into which they are introduced (e.g., 
bacterial vectors having a bacterial origin of replication and episomal mammalian 
vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the 
5 genome of a host cell upon introduction into the host cell, and thereby are replicated 
along with the host genome. Moreover, certain vectors are capable of directing the 
expression of genes to which they are operatively linked. Such vectors are referred to 
herein as "expression vectors". In general, expression vectors of utility in recombinant 
DNA techniques are often in the form of plasmids. In the present specification, 
10 "plasmid" and "vector" can be used interchangeably as the plasmid is the most 

commonly used form of vector. However, the invention is intended to include such 
other forms of expression vectors, such as viral vectors (e.g., replication defective 
retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent 
functions. 

15 The recombinant expression vectors of the invention comprise a nucleic acid of 

the invention in a form suitable for expression of the nucleic acid in a host cell, which 
means that the recombinant expression vectors include one or more regulatory 
sequences, selected on the basis of the host cells to be used for expression, which is 
operatively linked to the nucleic acid sequence to be expressed. Within a recombinant 

20 expression vector, "operably linked" is intended to mean that the nucleotide sequence of 
interest is linked to the regulatory sequence(s) in a manner which allows for expression 
of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a 
host cell when the vector is introduced into the host cell). The term "regulatory 
sequence" is intended to include promoters, enhancers and other expression control 

25 elements (e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel; Gene Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, CA (1990). Regulatory sequences include those which 
direct constitutive expression of a nucleotide sequence in many types of host cells and 
those which direct expression of the nucleotide sequence only in certain host cells (e.g., 

30 tissue-specific regulatory sequences). It will be appreciated by those skilled in the art 
that the design of the expression vector can depend on such factors as the choice of the 
host cell to be transformed, the level of expression of protein desired, and the like. The 
expression vectors of the invention can be introduced into host cells to thereby produce 
proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as 

35 described herein (e.g., TLCC-4 or TLCC-5 polypeptides, mutant forms of TLCC-4 or 
TLCC-5 polypeptides, fusion proteins, and the like). 
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Accordingly, an exemplary embodiment provides a method for producing a 
protein, preferably a TLCC-4 or TLCC-5 polypeptide, by culturing in a suitable medium 
a host cell of the invention (e.g., a mammalian host cell such as a non-human 
mammalian cell) containing a recombinant expression vector, such that the protein is 
5 produced. 

The recombinant expression vectors of the invention can be designed for 
expression of TLCC-4 or TLCC-5 polypeptides in prokaryotic or eukaryotic cells. For 
example, TLCC-4 or TLCC-5 polypeptides can be expressed in bacterial cells such as E. 
coli, insect cells (using baculovirus expression vectors) yeast cells or mammalian cells. 

10 Suitable host cells are discussed further in Goeddel, Gene Expression Technology: 

Methods in Enzymology 185, Academic Press, San Diego, CA (1990). Alternatively, the 
recombinant expression vector can be transcribed and translated in vitro, for example 
using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in E. coli with 

15 vectors containing constitutive or inducible promoters directing the expression of either 
fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
vectors typically serve three purposes: 1) to increase expression of recombinant protein; 
2) to increase the solubility of the recombinant protein; and 3) to aid in the purification 

20 of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein 
from the fusion moiety subsequent to purification of the fusion protein. Such enzymes, 
and their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 

25 Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith, D.B. 
and Johnson, K.S. (1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) 
and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), 
maltose E binding protein, or protein A, respectively, to the target recombinant protein. 
Purified fusion proteins can be utilized in TLCC-4 or TLCC-5 activity assays, 

30 (e.g., direct assays or competitive assays described in detail below), or to generate 
antibodies specific for TLCC-4 or TLCC-5 polypeptides, for example. In a preferred 
embodiment, a TLCC-4 or TLCC-5 fusion protein expressed in a retroviral expression 
vector of the present invention can be utilized to infect bone marrow cells which are 
subsequently transplanted into irradiated recipients. The pathology of the subject 

35 recipient is then examined after sufficient time has passed (e.g. , six (6) weeks). 

Examples of suitable inducible non-fusion E. coli expression vectors include 
pTrc (Amann et al, (1988) Gene 69:301-315) and pET 1 Id (Studier et al, Gene 
Expression Technology: Methods in Enzymology 1.85, Academic Press, San Diego, 
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California (1990) 60-89). Target gene expression from the pTrc vector relies on host 
RNA polymerase transcription from a hybrid trp-lac fusion promoter. Target gene 
expression from the pET lid vector relies on transcription from a T7 gnlO-lac fusion 
promoter mediated by a coexpressed viral RNA polymerase (T7 gnl). This viral 
5 polymerase is supplied by host strains BL21(DE3) or HMS174(DE3) from a resident 
prophage harboring a T7 gnl gene under the transcriptional control of the lacUV 5 
promoter. 

One strategy to maximize recombinant protein expression in E. coli is to express 
the protein in a host bacteria with an impaired capacity to proteolytically cleave the 

10 recombinant protein (Gbttesman, S., Gene Expression Technology: Methods in 

Enzymology 185, Academic Press, San Diego, California (1990) 1 19-128). Another 
strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an 
expression vector so that the individual codons for each amino acid are those 
preferentially utilized in E. coli (Wada et al 9 (1992) Nucleic Acids Res. 20:21 11-211 8). 

15 Such alteration of nucleic acid sequences of the invention can be carried out by standard 
DNA synthesis techniques. 

In another embodiment, the TLCC-4 or TLCC-5 expression vector is a yeast 
expression vector. Examples of vectors for expression in yeast S. cerevisiae include 
pYepSecl (Baldari, et al, (1987) EmboJ. 6:229-234), pMFa (Kurjan and Herskowitz, 

20 (1982) Cell 30:933-943), pJRY88 (Schultz etal., (1987) Gene 54:1 13-123), pYES2 
(Invitrogen Corporation, San Diego, CA), and picZ (InVitrogen Corp, San Diego, CA). 

Alternatively, TLCC-4 or TLCC-5 polypeptides can be expressed in insect cells 
using baculovirus expression vectors. Baculovirus vectors available for expression of 
proteins in cultured insect cells (e.g., Sf 9 cells) include the pAc series (Smith et al 

25 (1983) Mol Cell Biol. 3:2156-2165) and the pVL series (Lucklow and Summers (1989) 
Virology 170:31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed, B. (1987) Nature 329:840) and pMT2PC 

30 (Kaufinan et al (1 987) EMBO J. 6: 1 87-1 95). When used in mammalian cells, the 
expression vector's control functions are often provided by viral regulatory elements. 
For example, commonly used promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
prokaryotic and eukaryotic cells see chapters 16 and 17 of Sambrook, J., Fritsh, E. F., 

35 and Maniatis, T. Molecular Cloning: A Laboratory Manual. 2nd, ed. } Cold Spring 
Harbor Laboratory; Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989. 
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In another embodiment, the recombinant mammalian expression vector is 
capable of directing expression of the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
5 tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al 
(1987) Genes Dev. 1:268-277), lymphoid-specific promoters (Calame and Eaton (1988) 
Adv. Immunol. 43:235-275), in particular promoters of T cell receptors (Winoto and 
Baltimore (1989) EMBOJ. 8:729-733) and immunoglobulins (Banerji et al (1983) Cell 
33:729-740; Queen and Baltimore (1983) Cell 33:741-748), neuron-specific promoters 

10 (e.g., the neurofilament promoter; Byrne and Ruddle (1989) Proc. Natl. Acad. Set USA 
86:5473-5477), pancreas-specific promoters (Edlund et al (1985) Science 230:912-916), 
and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 
4,873,316 and European Application Publication No. 264,166). Developmentally- 
regulated promoters are also encompassed, for example the murine hox promoters 

15 (Kessel and Gruss (1990) Science 249:374-379) and the a-fetoprotein promoter 
(Campes and Tilghman (1989) Genes Dev. 3:537-546). 

The invention further provides a recombinant expression vector comprising a 
DNA molecule of the invention cloned into the expression vector in an antisense 
orientation. That is, the DNA molecule is operatively linked to a regulatory sequence in 

20 a manner which allows for expression (by transcription of the DNA molecule) of an 

RNA molecule which is antisense to TLCC-4 or TLCC-5 rnRNA. Regulatory sequences 
operatively linked to a nucleic acid cloned in the antisense orientation can be chosen 
which direct the continuous expression of the antisense RNA molecule in a variety of 
cell types, for instance viral promoters and/or enhancers, or regulatory sequences can be 

25 chosen which direct constitutive, tissue specific or cell type specific expression of 
antisense RNA. The antisense expression vector can be in the form of a recombinant 
plasmid, phagemid or attenuated virus in which antisense nucleic acids are produced 
under the control of a high efficiency regulatory region, the activity of which can be 
determined by the cell type into which the vector is introduced. For a discussion of the 

30 regulation of gene expression using antisense genes see Weintraub, H. et al, Antisense 
RNA as a molecular tool for genetic analysis, Reviews - Trends in Genetics, Vol. 1(1) 
1986. 

Another aspect of the invention pertains to host cells into which a TLCC-4 or 
TLCC-5 nucleic acid molecule of the invention is introduced, e.g., a TLCC-4 or TLCC- 
35 5 nucleic acid molecule within a vector (e.g. , a recombinant expression vector) or a 
TLCC-4 or TLCC-5 nucleic acid molecule containing sequences which allow it to 
homologously recombine into a specific site of the host cell's genome. The terms "host 
cell" and "recombinant host cell" are used interchangeably herein. It is understood that 
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such terms refer not only to the particular subject cell but to the progeny or potential 
. progeny of such a cell. Because certain modifications may occur in succeeding 

generations due to either mutation or environmental influences, such progeny may not, 
in fact, be identical to the parent cell, but are still included within the scope of the term 
5 as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, a TLCC-4 or 
TLCC-5 polypeptide can be expressed in bacterial cells such as E. coli, insect cells, 
yeast or mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells). 
Other suitable host cells are known to those skilled in the art. 

10 Vector DNA can be introduced into prokaryotic or eukaryotic cells via 

conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including 
calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 

15 transfection, lipofection, or electroporation. Suitable methods for transforming or 
transfecting host cells can be found in Sambrook, et al. (Molecular Cloning: A 
Laboratory Manual. 2nd, ed. y Cold Spring Harbor Labor atoiy, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 

20 expression vector and transfection technique used, only a small fraction of ceils may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Preferred 
selectable markers include those which confer resistance to drugs, such as G418, 

25 hygromycin and methotrexate. Nucleic acid encoding a selectable marker can be 
introduced into a host cell on the same vector as that encoding a TLCC-4 or TLCC-5 
polypeptide, or a nucleic acid encoding a selectable marker can be introduced on a 
separate vector. Cells stably transfected with the introduced nucleic acid can be 
identified by drug selection (e.g., cells that have incorporated the selectable marker gene 

30 will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce (i.e., express) a TLCC-4 or TLCC-5 polypeptide. 
Accordingly, the invention further provides methods for producing a TLCC-4 or TLCC- 
5 polypeptide using the host cells of the invention. In one embodiment, the method 

35 comprises culturing the host cell of the invention (into which a recombinant expression 
vector encoding a TLCC-4 or TLCC-5 polypeptide has been introduced) in a suitable 
medium such that a TLCC-4 or TLCC-5 polypeptide is produced. In another 
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embodiment, the method further comprises isolating a TLCC-4 or TLCC-5 polypeptide 
from the medium or the host cell. 

The host cells of the invention can also be used to produce non-human transgenic 
animals. For example, in one embodiment, a host cell of the invention is a fertilized 
5 oocyte or an embryonic stem cell into which TLCC-4-coding sequences or TLCC-5- 
coding sequences have been introduced. Such host cells can then be used to create non- 
human transgenic animals in which exogenous TLCC-4 or TLCC-5 sequences have 
been introduced into their genome or homologous recombinant animals in which 
endogenous TLCC-4 or TLCC-5 sequences have been altered. Such animals are useful 

10 for studying the function and/or activity of a TLCC-4 or TLCC-5 and for identifying 
and/or evaluating modulators of TLCC-4 or TLCC-5 activity. As used herein, a 
"transgenic animal" is a non-human animal, preferably a mammal, more preferably a 
rodent such as a rat or mouse, in which one or more of the cells of the animal includes a 
transgene. Other examples of transgenic animals include non-human primates, sheep, 

15 dogs, cows, goats, chickens, amphibians, and the like. A transgene is exogenous DNA 
which is integrated into the genome of a cell from which a transgenic animal develops 
and which remains in the genome of the mature animal, thereby directing the expression 
of an encoded gene product in one or more cell types or tissues of the transgenic animal. 
As used herein, a "homologous recombinant animal" is a non-human animal, preferably 

20 a mammal, more preferably a mouse, in which an endogenous TLCC-4 or TLCC-5 gene 
has been altered by homologous recombination between the endogenous gene and an 
exogenous DNA molecule introduced into a cell of the animal, e.g., an embryonic cell of 
the animal, prior to development of the animal. 

A transgenic animal of the invention can be created by introducing a TLCC-4- 

25 encoding nucleic acid or TLCC-5 -encoding nucleic acid into the male pronuclei of a 
fertilized oocyte, e.g., by microinjection, retroviral infection, and allowing the oocyte to 
develop in a pseudopregnant female foster animal. The TLCC-4 or TLCC-5 cDNA 
sequence of SEQ ID NO:l or 4 can be introduced as a transgene into the genome of a 
non-human animal. Alternatively, a nonhuman homologue of a human TLCC-4 or 

30 TLCC-5 gene, such as a mouse or rat TLCC-4 or TLCC-5 gene, can be used as a 
transgene. Alternatively, a TLCC-4 or TLCC-5 gene homologue, such as another 
TLCC-4 or TLCC-5 family member, can be isolated based on hybridization to the 
TLCC-4 or TLCC-5 cDNA sequences of SEQ ID NO:l, 3, 4, or 6, or the DNA inserts of 
the plasmids deposited with ATCC as Accession Numbers (described further in 

35 subsection I above) and used as a transgene. Intronic sequences and polyadenylation 
signals can also be included in the transgene to increase the efficiency of expression of 
the transgene. A tissue-specific regulatory sequence(s) can be operably linked to a 
TLCC-4 or TLCC-5 transgene to direct expression of a TLCC-4 or TLCC-5 polypeptide 
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to particular cells. Methods for generating transgenic animals via embryo manipulation 
and microinjection, particularly animals such as mice, have become conventional in the 
art and are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, both by 
Leder et al, U.S. Patent No. 4,873,191 by Wagner et al and in Hogan, B., Manipulating 
5 the Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 
1 986). Similar methods are used for production of other transgenic animals. A 
transgenic founder animal can be identified based upon the presence of a TLCC-4 or 
TLCC-5 transgene in its genome and/or expression of TLCC-4 or TLCC-5 mRNA in 
tissues or cells of the animals. A transgenic founder animal can then be used to breed 

1 0 additional animals carrying the transgene. Moreover, transgenic animals carrying a 
transgene encoding a TLCC-4 or TLCC-5 polypeptide can further be bred to other 
transgenic animals carrying other transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains 
at least a portion of a TLCC-4 or TLCC-5 gene into which a deletion, addition or 

15 substitution has been introduced to thereby alter, e.g., functionally disrupt, the TLCC-4 
or TLCC-5 gene. The TLCC-4 or TLCC-5 gene can be a human gene (e.g., the cDNA 
of SEQ ID NO: 3 or 6), but more preferably, is a non-human homologue of a human 
TLCC-4 or TLCC-5 gene {e.g., a cDNA isolated by stringent hybridization with the 
nucleotide sequence of SEQ ID NO:l or 4). For example, a mouse TLCC-4 or TLCC r 5 

20 gene can be used to construct a homologous recombination nucleic acid molecule, e.g., a 
vector, suitable for altering an endogenous TLCC-4 or TLCC-5 gene in the mouse 
genome. In a preferred embodiment, the homologous recombination nucleic acid 
molecule is designed such that, upon homologous recombination, the endogenous 
TLCC-4 or TLCC-5 gene is functionally disrupted (i.e., no longer encodes a functional 

25 polypeptide; also referred to as a "knock out" vector). Alternatively, the homologous 
recombination nucleic acid molecule can be designed such that, upon homologous 
recombination, the endogenous TLCC-4 or TLCC-5 gene is mutated or otherwise 
altered but still encodes functional polypeptide (e.g., the upstream regulatory region can 
be altered to thereby alter the expression of the endogenous TLCC-4 or TLCC-5 

30 polypeptide). In the homologous recombination nucleic acid molecule, the altered 
portion of the TLCC-4 or TLCC-5 gene is flanked at its 5' and 3 f ends by additional 
nucleic acid sequence of the TLCC-4 or TLCC-5 gene to allow for homologous 
recombination to occur between the exogenous TLCC-4 or TLCC-5 gene carried by the 
homologous recombination nucleic acid molecule and an endogenous TLCC-4 or 

35 TLCC-5 gene in a cell, e.g., an embryonic stem cell. The additional flanking TLCC-4 or 
TLCC-5 nucleic acid sequence is of sufficient length for successful homologous 
recombination with the endogenous gene. Typically, several kilobases of flanking DNA 
(both at the 5' and 3 f ends) are included in the homologous recombination nucleic acid 
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molecule (see, e.g., Thomas, K.R. and Capecchi, M. R. (1987) Cell 51:503 for a 
description of homologous recombination vectors). The homologous recombination 
nucleic acid molecule is introduced into a cell, e.g., an embryonic stem cell line (e.g. , by 
electroporation) and cells in which the introduced TLCC-4 or TLCC-5 gene has 
5 homologously recombined with the endogenous TLCC-4 or TLCC-5 gene are selected 
(see e.g., Li, E. et ah (1992) Cell 69:915). The selected cells can then injected into a 
blastocyst of an animal (e.g. , a mouse) to form aggregation chimeras (see e.g. , Bradley, 
A. in Teratocarcinomas and Embryonic Stem Cells: A Practical Approach, E.J. 
Robertson, ed. (IRL, Oxford, 1987) pp. 1 13-152). A chimeric embryo can then be 

10 implanted into a suitable pseudopregnant female foster animal and the embryo brought 
to term. Progeny harboring the homologously recombined DNA in their germ cells can 
be used to breed animals in which all cells of the animal contain the homologously 
recombined DNA by germline transmission of the transgene. Methods for constructing 
homologous recombination nucleic acid molecules, e.g., vectors, or homologous 

15 recombinant animals are described further in Bradley, A. (1991) Current Opinion in 
Biotechnology 2:823-829 and in PCT International Publication Nos.: WO 90/1 1354 by 
Le Mouellec et ah; WO 91/01 140 by Smithies et ah; WO 92/0968 by Zijlstra et ah; and 
WO 93/04169 by Bems et ah 

In another embodiment, transgenic non-human animals can be produced which 

20 contain selected systems which allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PI . For 
a description of the cre/loxP recombinase system, see, e.g., Lakso et ah (1992) Proc. .. 
Natl. Acad. Sci. USA 89:6232-6236. Another example of a recombinase system is the 
FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et ah (1991) Science 

25 251:1351-1355. If a cre/loxP recombinase system is used to regulate expression of the 
transgene, animals containing transgenes encoding both the Cre recombinase and a 
selected protein are required. Such animals can be provided through the construction of 
"double" transgenic animals, e.g., by mating two transgenic animals, one containing a 
transgene encoding a selected protein and the other containing a transgene encoding a 

30 recombinase. 

Clones of the non-human transgenic animals described herein can also be 

produced according to the methods described in Wilmut, I. et ah (1997) Nature 

385:810-813 and PCT International Publication Nos. WO 97/07668 and WO 97/07669. 

In brief, a cell, e.g., a somatic cell, from the transgenic animal can be isolated and 
35 induced to exit the growth cycle and enter G Q phase. The quiescent cell can then be 

fused, e.g., through the use of electrical pulses, to an enucleated oocyte from an animal 
of the same species from which the quiescent cell is isolated. The reconstructed oocyte 
is then cultured such that it develops to morula or blastocyte and then transferred to 
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pseudopregnant female foster animal. The offspring borne of this female foster animal 
will be a clone of the animal from which the cell, e.g., the somatic cell, is isolated. 

IV. Pharmaceutical Compositions 
5 The TLCC-4 and TLCC-5 nucleic acid molecules, fragments of TLCC-4 and 

TLCC-5 polypeptides, and anti-TLCC-4 and anti-TLCC-5 antibodies (also referred to 
herein as "active compounds") of the invention can be incorporated into pharmaceutical 
compositions suitable for administration. Such compositions typically comprise the 
nucleic acid molecule, polypeptide, or antibody and a pharmaceutical^ acceptable 

10 carrier. As used herein the language "pharmaceutical^ acceptable carrier" is intended to 
include any and all solvents, dispersion media, coatings, antibacterial and antifungal 
agents, isotonic and absorption delaying agents, and the like, compatible with 
pharmaceutical administration. The use of such media and agents for pharmaceutical^ 
active substances is well known in the art. Except insofar as any conventional media or 

15 agent is incompatible with the active compound, use thereof in the compositions is 
contemplated. Supplementary active compounds can also be incorporated into the 
compositions. 

A pharmaceutical composition of the invention is formulated to be compatible 
with its intended route of administration. Examples of routes of administration include 

20 parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), 

transdermal (topical), transmucosal, and rectal administration. Solutions or suspensions 
used for parenteral, intradermal, or subcutaneous application can include the following 
components: a sterile diluent such as water for injection, saline solution, fixed oils, 
polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 

25 antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic 
acid; buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases, 
such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be 

30 enclosed in ampoules, disposable syringes or multiple dose vials made of glass or 
plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersion. For 
35 intravenous administration, suitable carriers include physiological saline, bacteriostatic 
water, Cremophor EL™ (BASF, Parsippany, NJ) or phosphate buffered saline (PBS). 
In all cases, the composition must be sterile and should be fluid to the extent that easy 
syringability exists. It must be stable under the conditions of manufacture and storage 
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and must be preserved against the contaminating action of microorganisms such as 
bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for 
example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid 
polyetheylene glycol, and the like), and suitable mixtures thereof. The proper fluidity 
5 can be maintained, for example, by the use of a coating such as lecithin, by the 
maintenance of the required particle size in the case of dispersion and by the use of 
surfactants. Prevention of the action of microorganisms can be achieved by various 
antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, 
ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to include 

10 isotonic agents, for example, sugars, polyalcohols such as manitol, sorbitol, sodium 

chloride in the composition. Prolonged absorption of the injectable compositions can be 
brought about by including in the composition an agent which delays absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active 

15 compound (e.g., a fragment of a TLCC-4 or TLCC-5 polypeptide or an anti-TLCC-4 or 
anti-TLCC-5 antibody) in the required amount in an appropriate solvent with one or a 
combination of ingredients enumerated above, as required, followed by filtered 
sterilization. Generally, dispersions are prepared by incorporating the active compound 
into a sterile vehicle which contains a basic dispersion medium and the required other 

20 ingredients from those enumerated above. In the case of sterile powders for the . 
preparation of sterile injectable solutions, the preferred methods of preparation are 
vacuum drying and freeze-drying which yields a powder of the active ingredient plus 
any additional desired ingredient from a previously sterile-filtered solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They 

25 can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. Pharmaceutically 

30 compatible binding agents, and/or adjuvant materials can be included as part of the 
composition. The tablets, pills, capsules, troches and the like can contain any of the 
following ingredients, or compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
lactose, a disintegrating agent such as alginic acid, Primogel, or com starch; a lubricant 

35 such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a 
sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, 
methyl salicylate, or orange flavoring. 
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For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable propellant, 
e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
5 transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
fiisidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 
10 compounds are formulated into ointments, salves, gels, or creams as generally known in 
the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

15 In one embodiment, the active compounds are prepared with carriers that will 

protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
poly anhydrides, polygiycolic acid, collagen, polyorthoesters, and polylactic acid. 

20 Methods for preparation of such formulations will be apparent to those skilled in the art. 
The materials can also be obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected 
cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutical^ 
acceptable carriers. These can be prepared according to methods known to those skilled 

25 in the art, for example, as described in U.S. Patent No. 4,522,8 1 1 . 

It is especially advantageous to formulate oral or parenteral compositions in 
dosage unit form for ease of administration and uniformity of dosage. Dosage unit form 
as used herein refers to physically discrete units suited as unitary dosages for the subject 
to be treated; each unit containing a predetermined quantity of active compound 

30 calculated to produce the desired therapeutic effect in association with the required 

pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active compound 
and the particular therapeutic effect to be achieved, and the limitations inherent in the art 
of compounding such an active compound for the treatment of individuals. 

35 Toxicity and therapeutic efficacy of such compounds can be determined by 

standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for 
determining the LD50 (the dose lethal to 50% of the population) and the ED50 (the dose 
therapeutically effective in 50% of the population). The dose ratio between toxic and 



49 



WO 02/00722 



PCT/US01/20640 



therapeutic effects is the therapeutic index and it can be expressed as the ratio 
LD50/ED50. Compounds which exhibit large therapeutic indices are preferred. While 
compounds that exhibit toxic side effects may be used, care should be taken to design a 
delivery system that targets such compounds to the site of affected tissue in order to 
5 minimize potential damage to uninfected cells and, thereby, reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used in 
formulating a range of dosage for use in humans. The dosage of such compounds lies 
preferably within a range of circulating concentrations that include the ED50 with little 
or no toxicity. The dosage may vary within this range depending upon the dosage form 

10 employed and the route of administration utilized. For any compound used in the 
method of the invention, the therapeutically effective dose can be estimated initially 
from cell culture assays. A dose may be formulated in animal models to achieve a 
circulating plasma concentration range that includes the IC50 (i.e., the concentration of 
the test compound which achieves a half-maximal inhibition of symptoms) as 

15 determined in cell culture. Such information can be used to more accurately determine 
useful doses in humans. Levels in plasma may be measured, for example, by high 
performance liquid chromatography. 

As defined herein, a therapeutically effective amount of polypeptide (i.e., an 
effective dosage) ranges from about 0.001 to 30 mg/kg body weight, preferably about 

20 0.01 to 25 mg/kg body weight, more preferably about 0. 1 to 20 mg/kg body weight, and 
even more preferably about 1 to 10 mg/kg, 2 to 9 mg/kg, 3 to 8 mg/kg, 4 to 7 mg/kg, or 
5 to 6 mg/kg body weight. The skilled artisan will appreciate that certain factors may 
influence the dosage required to effectively treat a subject, including but not limited to 
the severity of the disease or disorder, previous treatments, the general health and/or age 

25 of the subject, and other diseases present. Moreover, treatment of a subject with a 
therapeutically effective amount of a polypeptide or antibody can include a single 
treatment or, preferably, can include a series of treatments. 

In a preferred example, a subject is treated with antibody or polypeptide in the 
range of between about 0.1 to 20 mg/kg body weight, one time per week for between 

30 about 1 to 10 weeks, preferably between about 2 to 8 weeks, more preferably between 
about 3 to 7 weeks, and even more preferably for about 4, 5, or 6 weeks. It will also be 
appreciated that the effective dosage of antibody or polypeptide used for treatment may 
increase or decrease over the course of a particular treatment. Changes in dosage may 
result and become apparent from the results of diagnostic assays as described herein. 

35 The present invention encompasses agents which modulate expression or 

activity. An agent may, for example, be a small molecule. For example, such small 
molecules include, but are not limited to, peptides, peptidomimetics, amino acids, amino 
acid analogs, polynucleotides, polynucleotide analogs, nucleotides, nucleotide analogs, 
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organic or inorganic compounds including heteroorganic and organometallic 
compounds) having a molecular weight less than about 1 0,000 grams per mole, organic 
or inorganic compounds having a molecular weight less than about 5,000 grams per 
mole, organic or inorganic compounds having a molecular weight less than about 1,000 
5 grams per mole, organic or inorganic compounds having a molecular weight less than 
about 500 grams per mole, and salts, esters, and other pharmaceutically acceptable 
forms of such compounds. It is understood that appropriate doses of small molecule 
agents depends upon a number of factors within the ken of the ordinarily skilled 
physician, veterinarian, or researcher. The dose(s) of the small molecule will vary, for 

10 example, depending upon the identity, size, and condition of the subject or sample being 
treated, further depending upon the route by which the composition is to be 
administered, if applicable, and the effect which the practitioner desires the small 
molecule to have upon the nucleic acid or polypeptide of the invention. 

Exemplary doses include milligram or microgram amounts of the small molecule 

15 per kilogram of subject or sample weight (e.g., about 1 microgram per kilogram to about 
500 milligrams per kilogram, about 100 micrograms per kilogram to about 5 milligrams 
per kilogram, or about 1 microgram per kilogram to about 50 micrograms per kilogram. 
It is 

furthermore understood that appropriate doses of a small molecule depend upon the 
20 potency of the small molecule with respect to the expression or activity to be modulated. 
Such appropriate doses may be determined using the assays described herein. When one 
or more of these small molecules is to be administered to an animal (e.g., a human) in 
order to modulate expression or activity of a polypeptide or nucleic acid of the 
invention, a physician, veterinarian, or researcher may, for example, prescribe a 
25 relatively low dose at first, subsequently increasing the dose until an appropriate 

response is obtained. In addition, it is understood that the specific dose level for any 
particular animal subject will depend upon a variety of factors including the activity of 
the specific compound employed, the age, body weight, general health, gender, and diet 
of the subject, the time of administration, the route of administration, the rate of 
30 excretion, any drug combination, and the degree of expression or activity to be 
modulated. 

Further, an antibody (or fragment thereof) may be conjugated to a therapeutic 
moiety such as a cytotoxin, a therapeutic agent or a radioactive metal ion. A cytotoxin 
or cytotoxic agent includes any agent that is detrimental to cells. Examples include 
35 taxol, cytochalasin B, gramicidin D, ethidium bromide, emetine, mitomycin, etoposide, 
tenoposide, vincristine, vinblastine, colchicin, doxorubicin, daunorubicin, dihydroxy 
anthracin dione, mitoxantrone, mithramycin, actinomycin D, 1-dehydrotestosterone, 
glucocorticoids, procaine, tetracaine, lidocaine, propranolol, and puromycin and analogs 
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or homologues thereof. Therapeutic agents include, but are not limited to, 
antimetabolites (e.g., methotrexate, 6-mercaptopurine, 6-thioguanine, cytarabine, 5- 
fluorouracil decarbazine), alkylating agents (e.g., mechlorethamine, thioepa 
chlorambucil, melphalan, carmustine (BSNU) and lomustine (CCNU), 
5 cyclothosphamide, busulfan, dibromomannitol, streptozotocin, mitomycin C, and cis- 
dichlorodiamine platinum (II) (DDP) cisplatin), anthracyclines (e.g., daunorubicin 
(formerly daunomycin) and doxorubicin), antibiotics (e.g., dactinomycin (formerly 
actinomycin), bleomycin, mithramycin, and anthramycin (AMC)), and anti-mitotic 
agents (e.g., vincristine and vinblastine). 

10 The conjugates of the invention can be used for modifying a given biological 

response, the drug moiety is not to be construed as limited to classical chemical 
therapeutic agents. For example, the drug moiety may be a polypeptide possessing a 
desired biological activity. Such polypeptides may include, for example, a toxin such as 
abrin, ricin A, pseudonaonas exotoxin, or diphtheria toxin; a protein such as tumor 

15 necrosis factor, alpha-interferon, beta-interferon, nerve growth factor, platelet derived 
growth factor, tissue plasminogen activator; or, biological response modifiers such as, 
for example, lymphokines, interleukin-1 ("IL-1"), interleukin-2 ("IL-2"), interleukin-6 
("IL-6"), granulocyte macrophage colony stimulating factor ("GM-CSF"), granulocyte 
colony stimulating factor ("G-CSF"), or other growth factors. 

20 Techniques for conjugating such therapeutic moiety to antibodies are well 

known, see, e.g., Arnon et ah, "Monoclonal Antibodies For Immunotargeting Of Drugs 
In Cancer Therapy", in Monoclonal Antibodies And Cancer Therapy, Reisfeld et ah 
(eds.), pp. 243-56 (Alan R. Liss, Inc. 1985); Hellstrom et ah, "Antibodies For Drug 
Delivery", in Controlled Drug Delivery (2nd Ed.), Robinson et ah (eds.), pp. 623-53 

25 (Marcel Dekker, Inc. 1987); Thorpe, "Antibody Carriers Of Cytotoxic Agents In Cancer 
Therapy: A Review", in Monoclonal Antibodies '84: Biological And Clinical 
Applications, Pinchera et ah (eds.), pp. 475-506 (1985); "Analysis, Results, And Future 
Prospective Of The Therapeutic Use Of Radiolabeled Antibody In Cancer Therapy", in 
Monoclonal Antibodies For Cancer Detection And Therapy, Baldwin et ah (eds.), pp. 

30 303-16 (Academic Press 1985), and Thorpe et ah, "The Preparation And Cytotoxic 
Properties Of Antibody-Toxin Conjugates", Immunol. Rev., 62:1 19-58 (1982). 
Alternatively, an antibody can be conjugated to a second antibody to form an antibody 
heteroconjugate as described by Segal in U.S. Patent No. 4,676,980. 

The nucleic acid molecules of the invention can be inserted into vectors and used 

35 as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for 

example, intravenous injection, local administration (see, e.g., U.S. Patent 5,328,470), or 

by stereotactic injection (see, e.g., Chen et ah (1994) Proc. Natl. Acad. Sci. USA 

91 :3054-3057). The pharmaceutical preparation of the gene therapy vector can include 
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the gene therapy vector in an acceptable diluent, or can comprise a slow release matrix 
in which the gene delivery vehicle is imbedded. Alternatively, where the complete gene 
delivery vector can be produced intact from recombinant cells, e.g., retroviral vectors, 
the pharmaceutical preparation can include one or more cells which produce the gene 
5 delivery system. 

The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

V. Uses and Methods of the Invention 

10 The nucleic acid molecules, polypeptides, polypeptide homologues, and 

antibodies described herein can be used in one or more of the following methods: a) 
screening assays; b) predictive medicine (e.g., diagnostic assays, prognostic assays, 
monitoring clinical trials, and pharmacogenetics); and c) methods of treatment (e.g., 
therapeutic and prophylactic). As described herein, a TLCC-4 or TLCC-5 polypeptide 

15 of the invention has one or more of the following activities: (1) modulates membrane 
excitability, (2) influences the resting potential of membranes, (3) modulates wave 
forms and frequencies of action potentials, (4) modulates thresholds of excitation, (5) 
modulates neurite outgrowth and synaptogenesis, (6) modulates signal transduction, and 
(7) participates in nociception. 

20 The isolated nucleic acid molecules of the invention can be used, for example, to 

express a TLCC-4 or TLCC-5 polypeptide (e.g., via a recombinant expression vector in 
a host cell in gene therapy applications), to detect TLCC-4 or TLCC-5 mRNA (e.g., in a 
biological sample) or a genetic alteration in a TLCC-4 or TLCC-5 gene, and to modulate 
TLCC-4 or TLCC-5 activity, as described further below. The TLCC-4 or TLCC-5 

25 polypeptides can be used to treat disorders characterized by insufficient or excessive 
production of a TLCC-4 or TLCC-5 substrate or production of TLCC-4 or TLCC-5 
inhibitors. In addition, the TLCC-4 or TLCC-5 polypeptides can be used to screen for 
naturally occurring TLCC-4 or TLCC-5 substrates, to screen for drugs or compounds 
which modulate TLCC-4 or TLCC-5 activity, as well as to treat disorders characterized 

30 by insufficient or excessive production of TLCC-4 or TLCC-5 polypeptide or 

production of TLCC-4 or TLCC-5 polypeptide forms which have decreased, aberrant or 
unwanted activity compared to TLCC-4 or TLCC-5 wild type polypeptide (e.g., CNS 
disorders (such as neurodegenerative disorders), pain disorders, or cellular growth, 
differentiation, or migration disorders). Moreover, the anti-TLCC-4 or anti-TLCC-5 

35 antibodies of the invention can be used to detect and isolate TLCC-4 or TLCC-5 

polypeptides, to regulate the bioavailability of TLCC-4 or TLCC-5 polypeptides, and 
modulate TLCC-4 or TLCC-5 activity. 
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A. Screening Assays : 

The invention provides a method (also referred to herein as a "screening assay") 
for identifying modulators, i.e. 9 candidate or test compounds or agents (e.g., peptides, 
peptidomimetics, small molecules or other drugs) which bind to TLCC-4 or TLCC-5 
5 polypeptides, have a stimulatory or inhibitory effect on, for example, TLCC-4 or TLCC- 
5 expression or TLCC-4 or TLCC-5 activity, or have a stimulatory or inhibitory effect 
on, for example, the expression or activity of TLCC-4 or TLCC-5 substrate. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which are substrates of a TLCC-4 or TLCC-5 polypeptide or biologically 

10 active portion thereof. In another embodiment, the invention provides assays for 
screening candidate or test compounds which bind to or modulate the activity of a 
TLCC-4 or TLCC-5 polypeptide or polypeptide or biologically active portion thereof. 

The screening assays of the present invention include, but are not limited to, 
screening assays specific for calcium receptors, e.g., measuring intracellular Ca 2+ 

15 concentrations, and/or assaying responses to the presence of the calcium channel 
stimulators inositol- 1,4,5 -triphosphate (IP 3 ) and/or diacylglycerol (DAG) (see, e.g., 
Hofmann et al. (1999) Nature 397:259-263; Okadae/a/. (1999) J. Biol Chem. 
274:27359-27370; Hofmann et al (2000) J. Mol Med. 78:14-25). Intracellular Ca 2+ 
levels may be assayed, for example, using the fluorescent dye fura-2 as an indicator. 

20 Stimulation by DAG may be assayed, for example, using DAG analogues (e.g., OAG 
(l-oleoyl-2-acetyl-S77-glycerol) and DOG (1,2-dioctanoyl -^-glycerol)). IP3 stimulation 
may be analyzed, for example, using the PLC inhibitor U-73 122 in the presence of ATP, 
which suppresses IP3-dependent Ca 2+ release. 

The test compounds of the present invention can be obtained using any of the 

25 numerous approaches in combinatorial library methods known in the art, including: 

biological libraries; spatially addressable parallel solid phase or solution phase libraries; 
synthetic library methods requiring deconvolution; the ! one-bead one-compound' library 
method; and synthetic library methods using affinity chromatography selection. The 
biological library approach is limited to peptide libraries, while the other four 

30 approaches are applicable to peptide, non-peptide oligomer or small molecule libraries 
of compounds (Lam, K.S. (1997) Anticancer Drug Des. 12:145). 

Examples of methods for the synthesis of molecular libraries can be found in the 
art, for example in: DeWitt et al (1993) Proc. Natl. Acad. Set U.S.A. 90:6909; Erb et 
al (1994) Proc. Natl Acad. Sci. USA 91 :1 1422; Zuckermann et al (1994). J. Med. 

35 Chem. 37:2678; Cho et al (1993) Science 261 :1303; Carrell et al. (1994) Angew. Chem. 
Int. Ed. Engl 33:2059; Carell et al. (1994) Angew. Chem. Int. Ed. Engl 33:2061; and in 
Gallop et al. (1994)./ Med. Chem. 37:1233. 
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Libraries of compounds may be presented in solution (e.g., Houghten (1992) 
Biotechniques 13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor 
(1993) Nature 364:555-556), bacteria (Ladner USP 5,223,409), spores (Ladner USP 
'409), plasmids (Cull et al. (1992) Proc Natl Acad Sci USA 89:1865-1869) or on phage 
5 (Scott and Smith (1 990) Science 249:386-390); (Devlin (1 990) Science 249:404-406); 
(Cwirla et al (1990) Proc. Natl. Acad. Sci. 87:6378-6382); (Felici (1991) J. Mol. Biol. 
222:301-310); (Ladner supra.). 

In one embodiment, an assay is a cell-based assay in which a cell which 
expresses a TLCC-4 or TLCC-5 polypeptide or biologically active portion thereof is 

10 contacted with a test compound and the ability of the test compound to modulate TLCC- 
4 or TLCC-5 activity is determined. Determining the ability of the test compound to 
modulate TLCC-4 or TLCC-5 activity can be accomplished by monitoring, for example, 
intracellular calcium, IP3, or diacylglycerol concentration, phosphorylation profile of 
intracellular proteins, or the activity of a TLCC-4-regulated or TLCC-5-regulated 

15 transcription factor. The cell, for example, can be of mammalian origin, e.g. , a neuronal 
cell, skin cell, or a liver cell. 

The ability of the test compound to modulate TLCC-4 or TLCC-5 binding to a 
substrate or to bind to TLCC-4 or TLCC-5 can also be determined. Determining the 
ability of the test compound to modulate TLCC-4 or TLCC-5 binding to a substrate can 

20 be accomplished, for example, by coupling the TLCC-4 or TLCC-5 substrate with a 

radioisotope or enzymatic label such that binding of the TLCC-4 or TLCC-5 substrate to 
TLCC-4 or TLCC-5 can be determined by detecting the labeled TLCC-4 or TLCC-5 
substrate in a complex. Alternatively, TLCC-4 or TLCC-5 could be coupled with a 
radioisotope or enzymatic label to monitor the ability of a test compound to modulate 

25 TLCC-4 or TLCC-5 binding to a TLCC-4 or TLCC-5 substrate in a complex. 
Determining the ability of the test compound to bind TLCC-4 or TLCC-5 can be 
accomplished, for example, by coupling the compound with a radioisotope or enzymatic 
label such that binding of the compound to TLCC-4 or TLCC-5 can be determined by 
detecting the labeled TLCC-4 or TLCC-5 compound in a complex. For example, 

30 compounds (e.g., TLCC-4 or TLCC-5 substrates) can be labeled with 125 I, 35 S, 14 C, or 
^H, either directly or indirectly, and the radioisotope detected by direct counting of 
radioemmission or by scintillation counting. Alternatively, compounds can be 
enzymatically labeled with, for example, horseradish peroxidase, alkaline phosphatase, 
or luciferase, and the enzymatic label detected by determination of conversion of an 

35 appropriate substrate to product. 

It is also within the scope of this invention to determine the ability of a 
compound (e.g., a TLCC-4 or TLCC-5 substrate) to interact with TLCC-4 or TLCC-5 
without the labeling of any of the interactants. For example, a microphysiometer can be 



WO 02/00722 



PCT/US01/20640 



used to detect the interaction of a compound with TLCC-4 or TLCC-5 without the 
labeling of either the compound or the TLCC-4 or TLCC-5. McConnell, H. M. et ah 
(1992) Science 257:1906-1912. As used herein, a "microphysiometer" (e.g., 
Cytosensor) is an analytical instrument that measures the rate at which a cell acidifies its 
5 environment using a light-addressable potentiometric sensor (LAPS). Changes in this 
acidification rate can be used as an indicator of the interaction between a compound and 
TLCC-4 or TLCC-5. 

In another embodiment, an assay is a cell-based assay comprising contacting a 
cell expressing a TLCC-4 or TLCC-5 target molecule (e.g., a TLCC-4 or TLCC-5 

10 substrate) with a test compound and determining the ability of the test compound to 
modulate (e.g., stimulate or inhibit) the activity of the TLCC-4 or TLCC-5 target 
molecule. Determining the ability of the test compound to modulate the activity of a 
TLCC-4 or TLCC-5 target molecule can be accomplished, for example, by determining 
the ability of the TLCC-4 or TLCC-5 polypeptide to bind to or interact with the TLCC-4 

1 5 or TLCC-5 target molecule. 

Determining the ability of the TLCC-4 or TLCC-5 polypeptide, or a biologically 
active fragment thereof, to bind to or interact with a TLCC-4 or TLCC-5 target molecule 
can be accomplished by one of the methods described above for determining direct 
binding. In a preferred embodiment, determining the ability of the TLCC-4 or TLCC-5 

20 polypeptide to bind to or interact with a TLCC-4 or TLCC-5 target molecule can be 
accomplished by determining the activity of the target molecule. For example, the 
activity of the target molecule can be determined by detecting induction of a cellular 

2+ 

second messenger of the target (i.e., intracellular Ca , diacylglycerol, IP 3 , and the like), 
detecting catalytic/enzymatic activity of the target using an appropriate substrate, 

25 detecting the induction of a reporter gene (comprising a target-responsive regulatory 
element operatively linked to a nucleic acid encoding a detectable marker, e.g., 
luciferase), or detecting a target-regulated cellular response. 

In yet another embodiment, an assay of the present invention is a cell-free assay 
in which a TLCC-4 or TLCC-5 polypeptide or biologically active portion thereof is 

30 contacted with a test compound and the ability of the test compound to bind to the 
TLCC-4 or TLCC-5 polypeptide or biologically active portion thereof is determined. 
Preferred biologically active portions of the TLCC-4 or TLCC-5 polypeptides to be used 
in assays of the present invention include fragments which participate in interactions 
with non-TLCC-4 or non-TLCC-5 molecules, e.g., fragments with high surface 

35 probability scores (see, for example, Figures 4 and 7). Binding of the test compound to 
the TLCC-4 or TLCC-5 polypeptide can be determined either directly or indirectly as 
described above. In a preferred embodiment, the assay includes contacting the TLCC-4 
or TLCC-5 polypeptide or biologically active portion thereof with a known compound 
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which binds TLCC-4 or TLCC-5 to form an assay mixture, contacting the assay mixture 
with a test compound, and determining the ability of the test compound to interact with a 
TLCC-4 or TLCC-5 polypeptide, wherein determining the ability of the test compound 
to interact with a TLCC-4 or TLCC-5 polypeptide comprises determining the ability of 
5 the test compound to preferentially bind to TLCC-4 or TLCC-5 or biologically active 
portion thereof as compared to the known compound. 

In another embodiment, the assay is a cell-free assay in which a TLCC-4 or 
TLCC-5 polypeptide or biologically active portion thereof is contacted with a test 
compound and the ability of the test compound to modulate (e.g., stimulate or inhibit) 

10 the activity of the TLCC-4 or TLCC-5 polypeptide or biologically active portion thereof 
is determined. Determining the ability of the test compound to modulate the activity of 
a TLCC-4 or TLCC-5 polypeptide can be accomplished, for example, by determining 
the ability of the TLCC-4 or TLCC-5 polypeptide to bind to a TLCC-4 or TLCC-5 target 
molecule by one of the methods described above for determining direct binding. 

15 Determining the ability of the TLCC-4 or TLCC-5 polypeptide to bind to a TLCC-4 or 
TLCC-5 target molecule can also be accomplished using a technology such as real-time 
Biomolecular Interaction Analysis (BIA). Sjolander, S. and Urbaniczky, C. (1991) 
Anal. Chem. 63:2338-2345 and Szabo etal (1995) Curr. Opin. Struct. Biol. 5:699-705. 
As used herein, "BIA" is a technology for studying biospecific interactions, in real time, 

20 without labeling any of the interactants (e.g., BIAcore). Changes in the optical 

phenomenon of surface plasmon resonance (SPR) can be used as an indication of real- 
time reactions between biological molecules. 

In an alternative embodiment, determining the ability of the test compound to 
modulate the activity of a TLCC-4 or TLCC-5 polypeptide can be accomplished by 

25 determining the ability of the TLCC-4 or TLCC-5 polypeptide to further modulate the 
activity of a downstream effector of a TLCC-4 or TLCC-5 target molecule. For 
example, the activity of the effector molecule on an appropriate target can be determined 
or the binding of the effector to an appropriate target can be determined as previously 
described. 

30 In yet another embodiment, the cell-free assay involves contacting a TLCC-4 or 

TLCC-5 polypeptide or biologically active portion thereof with a known compound 
which binds the TLCC-4 or TLCC-5 polypeptide to form an assay mixture, contacting 
the assay mixture with a test compound, and determining the ability of the test 
compound to interact with the TLCC-4 or TLCC-5 polypeptide, wherein determining 

35 the ability of the test compound to interact with the TLCC-4 or TLCC-5 polypeptide 
comprises determining the ability of the TLCC-4 or TLCC-5 polypeptide to 
preferentially bind to or modulate the activity of a TLCC-4 or TLCC-5 target molecule. 
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In more than one embodiment of the above assay methods of the present 
invention, it may be desirable to immobilize either TLCC-4 or TLCC-5 or its target 
molecule to facilitate separation of complexed from uncomplexed forms of one or both 
of the polypeptides, as well as to accommodate automation of the assay. Binding of a 
5 test compound to a TLCC-4 or TLCC-5 polypeptide, or interaction of a TLCC-4 or 
TLCC-5 polypeptide with a target molecule in the presence and absence of a candidate 
compound, can be accomplished in any vessel suitable for containing the reactants. 
Examples of such vessels include microtiter plates, test tubes, and micro-centrifuge 
tubes. In one embodiment, a fusion protein can be provided which adds a domain that 

10 allows one or both of the proteins to be bound to a matrix. For example, glutathione-S- 
transferase/ TLCC-4 or TLCC-5 fusion proteins or glutathione-S-transferase/target 
fusion proteins can be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. 
Louis, MO) or glutathione derivatized micrometer plates, which are then combined with 
the test compound or the test compound and either the non-adsorbed target protein or 

15 TLCC-4 or TLCC-5 polypeptide, and the mixture incubated under conditions conducive 
to complex formation (e.g., at physiological conditions for salt and pH). Following 
incubation, the beads or micrometer plate wells are washed to remove any unbound 
components, the matrix immobilized in the case of beads, complex determined either 
directly or indirectly, for example, as described above. Alternatively, the complexes can 

20 be dissociated from the matrix, and the level of TLCC-4 or TLCC-5 binding or activity 
determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either a TLCC-4 or TLCC-5 
polypeptide or a TLCC-4 or TLCC-5 target molecule can be immobilized utilizing 

25 conjugation of biotin and streptavidin. Biotinylated TLCC-4 or TLCC-5 polypeptide or 
target molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using 
techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, IL), and 
immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). 
Alternatively, antibodies reactive with TLCC-4 or TLCC-5 polypeptide or target 

30 molecules but which do not interfere with binding of the TLCC-4 or TLCC-5 
polypeptide to its target molecule can be derivatized to the wells of the plate, and 
unbound target or TLCC-4 or TLCC-5 polypeptide trapped in the wells by antibody 
conjugation. Methods for detecting such complexes, in addition to those described 
above for the GST-immobilized complexes, include immunodetection of complexes 

35 using antibodies reactive with the TLCC-4 or TLCC-5 polypeptide or target molecule, 
as well as enzyme-linked assays which rely on detecting an enzymatic activity 
associated with the TLCC-4 or TLCC-5 polypeptide or target molecule. 
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In another embodiment, modulators of TLCC-4 or TLCC-5 expression are 
identified in a method wherein a cell is contacted with a candidate compound and the 
expression of TLCC-4 or TLCC-5 mRNA or polypeptide in the cell is determined. The 
level of expression of TLCC-4 or TLCC-5 mRNA or polypeptide in the presence of the 
5 candidate compound is compared to the level of expression of TLCC-4 or TLCC-5 
mRNA or polypeptide in the absence of the candidate compound. The candidate 
compound can then be identified as a modulator of TLCC-4 or TLCC-5 expression 
based on this comparison. For example, when expression of TLCC-4 or TLCC-5 
mRNA or polypeptide is greater (statistically significantly greater) in the presence of the 

10 candidate compound than in its absence, the candidate compound is identified as a 
stimulator of TLCC-4 or TLCC-5 mRNA or polypeptide expression. Alternatively, 
when expression of TLCC-4 or TLCC-5 mRNA or polypeptide is less (statistically 
significantly less) in the presence of the candidate compound than in its absence, the 
candidate compound is identified as an inhibitor of TLCC-4 or TLCC-5 mRNA or 

15 polypeptide expression. The level of TLCC-4 or TLCC-5 mRNA or polypeptide 
expression in the cells can be determined by methods described herein for detecting 
TLCC-4 or TLCC-5 mRNA or polypeptide. 

In yet another aspect of the invention, the TLCC-4 or TLCC-5 polypeptides can 
be used as "bait proteins" in a two-hybrid assay or three-hybrid assay (see, e.g., U.S. 

20 PatentNo. 5,283,317; Zervos et al (1993) Cell 72:223-232: Madura et al. (1993) J. 

Biol Chem. 268:12046-12054; Bartel et al (1993) Biotechniques 14:920-924; Iwabuchi . 
et al (1993) Oncogene 8:1693-1696; and Brent WO94/10300), to identify other 
proteins, which bind to or interact with TLCC-4 or TLCC-5 ("TLCC-4-binding proteins 
or TLCC-5-binding proteins" or "TLCC-4-bp or TLCC-5-bp") and are involved in 

25 TLCC-4 or TLCC-5 activity. Such TLCC-4-binding proteins or TLCC-5-binding 
proteins are also likely to be involved in the propagation of signals by the TLCC-4 or 
TLCC-5 polypeptides or TLCC-4 or TLCC-5 targets as, for example, downstream 
elements of a TLCC-4-mediated or TLCC-5-mediated signaling pathway. Alternatively, 
such TLCC-4-binding proteins or TLCC-5 -binding proteins are likely to be TLCC-4 or 

30 TLCC-5 inhibitors. 

The two-hybrid system is based on the modular nature of most transcription 
factors, which consist of separable DNA-binding and activation domains. Briefly, the 
assay utilizes two different DNA constructs. In one construct, the gene that codes for a 
TLCC-4 or TLCC-5 polypeptide is fused to a gene encoding the DNA binding domain 

35 of a known transcription factor {e.g. , GAL-4). In the other construct, a DNA sequence, 
from a library of DNA sequences, that encodes an unidentified protein ("prey" or 
"sample") is fused to a gene that codes forthe activation domain of the known 
transcription factor. If the "bait" and the "prey" proteins are able to interact, in vivo, 
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forming a TLCC-4-dependent or TLCC-5-dependent complex, the DNA-binding and 
activation domains of the transcription factor are brought into close proximity. This 
proximity allows transcription of a reporter gene (e.g., LacZ) which is operably linked to 
a transcriptional regulatory site responsive to the transcription factor. Expression of the 
5 reporter gene can be detected and cell colonies containing the functional transcription 
factor can be isolated and used to obtain the cloned gene which encodes the protein 
which interacts with the TLCC-4 or TLCC-5 polypeptide. 

In another aspect, the invention pertains to a combination of two or more of the 
assays described herein. For example, a modulating agent can be identified using a cell- 

10 based or a cell free assay, and the ability of the agent to modulate the activity of a 

TLCC-4 or TLCC-5 polypeptide can be confirmed in vivo, e.g., in an animal such as an 
animal model for cellular transformation and/or tumorigenesis. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an 

15 agent identified as described herein in an appropriate animal model. For example, an 
agent identified as described herein (e.g., a TLCC-4 or TLCC-5 modulating agent, an 
antisense TLCC-4 or TLCC-5 nucleic acid molecule, a TLCC-4-specific or TLCC-5- 
specific antibody, or a TLCC-4-binding partner or TLCC-5-binding partner) can be used 
in an animal model to determine the efficacy, toxicity, or side effects of treatment with 

20 such an agent. Alternatively, an agent identified as described herein can be used in an , 
animal model to determine the mechanism of action of such an agent. Furthermore, this 
invention pertains to uses of novel agents identified by the above-described screening 
assays for treatments as described herein. 

Models for studying pain in vivo include rat models of neuropathic pain caused 

25 by methods such as intraperitoneal administration of Taxol (Authier et ah (2000) Brain 
Res. 887:239-249), chronic constriction injury (CCI), partial sciatic nerve transection 
(Linenlaub and Sommer (2000) Pain 89:97-106), transection of the tibial and sural 
nerves (Lee et ah (2000) Neuroscl Lett 291:29-32), the spared nerve injury model 
(Decosterd and Woolf (2000) Pain 87:149-158), cuffing the sciatic nerve (Pitcher and 

30 Henry (2000) Eur. J. Neuroscl 12:2006-2020), unilateral tight ligation (Esser and 

Sawynok (2000) Eur. J. Pharmacol. 399:131-139), L5 spinal nerve ligation (Honroe et 
ah (2000) Neuroscl 98:585-598), and photochemically induced ischemic nerve injury 
(Hao et ah (2000) Exp. Neurol 163:231-238); rat models of nociceptive pain caused by 
methods such as the Chung Method, the Bennett Method, and intraperitoneal 

35 administration of Complete Freund's adjuvant (Abdi et ah (2000) Anesth. Analg. 

91 :955-959); rat models of post-incisional pain caused by incising the skin and fascia of 
a hind paw (Olivera and Prado (2000) Braz. J. Med, Biol. Res. 33:957-960); rat models 
of cancer pain caused by methods such as injecting osteolytic sarcoma cells into the 
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femur (Honroe et ah (2000) Neurosci. 98:585-598); and rat models of visceral pain 
caused by methods such as intraperitoneal administration of cyclophosphamide. 

Various methods of determining an animal's response to pain are known in the 
art. Examples of such methods include, but are not limited to brief intense exposure to a 
5 focused heat source, administration of a noxious chemical subcutaneously, the tail flick 
test, the hot plate test, the formalin test, the Von Frey threshold test, and testing for 
stress-induced analgesia (e.g., by restraint, foot shock, and/or cold water swim) 
(Crawley (2000) What's Wrong With My Mouse? Wiley-Liss pp. 72-75). 

10 B. Detection Assays 

Portions or fragments of the cDNA sequences identified herein (and the 
corresponding complete gene sequences) can be used in numerous ways as 
polynucleotide reagents. For example, these sequences can be used to: (i) map their 
respective genes on a chromosome; and, thus, locate gene regions associated with 

15 genetic disease; (ii) identify an individual from a minute biological sample (tissue 

typing); and (iii) aid in forensic identification of a biological sample. These applications 
are described in the subsections below. 

1. Chromosome Mapping 

20 Once the sequence (or a portion of the sequence) of a gene has been isolated, this 

sequence can be used to map the location of the gene on a chromosome. This process is 
called chromosome mapping. Accordingly, portions or fragments of the TLCC-4 or 
TLCC-5 nucleotide sequences, described herein, can be used to map the location of the 
TLCC-4 or TLCC-5 genes on a chromosome. The mapping of the TLCC-4 or TLCC-5 

25 sequences to chromosomes is an important first step in correlating these sequences with 
genes associated with disease. 

Briefly, TLCC-4 or TLCC-5 genes can be mapped to chromosomes by preparing 
PCR primers (preferably 15-25 bp in length) from the TLCC-4 or TLCC-5 nucleotide 
sequences. Computer analysis of the TLCC-4 or TLCC-5 sequences can be used to 

30 predict primers that do not span more than one exon in the genomic DNA, thus 
complicating the amplification process. These primers can then be used for PCR 
screening of somatic cell hybrids containing individual human chromosomes. Only 
those hybrids containing the human gene corresponding to the TLCC-4 or TLCC-5 
sequences will yield an amplified fragment. 

35 Somatic cell hybrids are prepared by fusing somatic cells from different 

mammals (e.g., human and mouse cells). As hybrids of human and mouse cells grow 
and divide, they gradually lose human chromosomes in random order, but retain the 
mouse chromosomes. By using media in which mouse cells cannot grow, because they 
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lack a particular enzyme, but human cells can, the one human chromosome that contains 
the gene encoding the needed enzyme, will be retained. By using various media, panels 
of hybrid cell lines can be established. Each cell line in a panel contains either a single 
human chromosome or a small number of human chromosomes, and a full set of mouse 
5 chromosomes, allowing easy mapping of individual genes to specific human 
chromosomes. (D'Eustachio P. et al. (1983) Science 220:919-924). Somatic cell 
hybrids containing only fragments of human chromosomes can also be produced by 
using human chromosomes with translocations and deletions. 

PCR mapping of somatic cell hybrids is a rapid procedure for assigning a 

10 particular sequence to a particular chromosome. Three or more sequences can be 
assigned per day using a single thermal cycler. Using the TLCC-4 or TLCC-5 
nucleotide sequences to design oligonucleotide primers, sublocalization can be achieved 
with panels of fragments from specific chromosomes. Other mapping strategies which 
can similarly be used to map a|jTLCC-4 or TLCC-5 sequence to its chromosome include 

15 in situ hybridization (described in Fan, Y. et al (1990) Proc. Natl Acad, Set USA, 

87:6223-27), pre-screening with labeled flow-sorted chromosomes, and pre-selection by 
hybridization to chromosome specific cDNA libraries. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 
chromosomal spread can further be used to provide a precise chromosomal location in 

20 one step. Chromosome spreads can be made using cells whose division has been 

blocked in metaphase by a chemical such as colcemid that disrupts the mitotic spindle. 
The chromosomes can be treated briefly with trypsin, and then stained with Giemsa. A 
pattern of light and dark bands develops on each chromosome, so that the chromosomes 
can be identified individually. The FISH technique can be used with a DNA sequence 

25 as short as 500 or 600 bases. However, clones larger than 1,000 bases have a higher 
likelihood of binding to a unique chromosomal location with sufficient signal intensity 
for simple detection. Preferably 1,000 bases, and more preferably 2,000 bases will 
suffice to get good results at a reasonable amount of time. For a review of this 
technique, see Verma et al, Human Chromosomes: A Manual of Basic Techniques 

30 (Pergamon Press, New York 1 988). 

Reagents for chromosome mapping can be used individually to mark a single 
chromosome or a single site on that chromosome, or panels of reagents can be used for 
marking multiple sites and/or multiple chromosomes. Reagents corresponding to 
noncoding regions of the genes actually are preferred for mapping purposes. Coding 

35 sequences are more likely to be conserved within gene families, thus increasing the 
chance of cross hybridizations during chromosomal mapping. 
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Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with genetic 
map data. (Such data are found, for example, in V. McKusick, Mendelian Inheritance in 
Man, available on-line through Johns Hopkins University Welch Medical Library). The 
5 relationship between a gene and a disease, mapped to the same chromosomal region, can 
then be identified through linkage analysis (co-inheritance of physically adjacent genes), 
described in, for example, Egeland, J. et al (1987) Nature, 325:783-787. 

Moreover, differences in the DNA sequences between individuals affected and 
unaffected with a disease associated with the TLCC-4 or TLCC-5 gene, can be 

10 determined. If a mutation is observed in some or all of the affected individuals but not 
in any unaffected individuals, then the mutation is likely to be the causative agent of the 
particular disease. Comparison of affected and unaffected individuals generally 
involves first looking for structural alterations in the chromosomes, such as deletions or 
translocations that are visible from chromosome spreads or detectable using PCR based 

15 on that DNA sequence. Ultimately, complete sequencing of genes from several 

individuals can be performed to confirm the presence of a mutation and to distinguish 
mutations from polymorphisms. 

2. Tissue Typing 

20 The TLCC-4 or TLCC-5 sequences of the present invention can also be used to 

identify individuals from minute biological samples. The United States military, for 
example, is considering the use of restriction fragment length polymorphism (RFLP) for 
identification of its personnel. In this technique, an individual's genomic DNA is 
digested with one or more restriction enzymes, and probed on a Southern blot to yield 

25 unique bands for identification. This method does not suffer from the current limitations 
of "Dbg Tags" which can be lost, switched, or stolen, making positive identification 
difficult. The sequences of the present invention are useful as additional DNA markers 
for RFLP (described in U.S. Patent 5,272,057). 

Furthermore, the sequences of the present invention can be used to provide an 

30 alternative technique which determines the actual base-by-base DNA sequence of 

selected portions of an individual's genome. Thus, the TLCC-4 or TLCC-5 nucleotide 
sequences described herein can be used to prepare two PCR primers from the 5 f and 3' 
ends of the sequences. These primers can then be used to simplify an individual's DNA 
and subsequently sequence it. 

35 Panels of corresponding DNA sequences from individuals, prepared in this 

manner, can provide unique individual identifications, as each individual will have a 
unique set of such DNA sequences due to allelic differences. The sequences of the 
present invention can be used to obtain such identification sequences from individuals 
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and from tissue. The TLCC-4 or TLCC-5 nucleotide sequences of the invention 
uniquely represent portions of the human genome. Allelic variation occurs to some 
degree in the coding regions of these sequences, and to a greater degree in the 
noncoding regions. It is estimated that allelic variation between individual humans 
occurs with a frequency of about once per each 500 bases. Each of the sequences 
described herein can, to some degree, be used as a standard against which DNA from an 
individual can be compared for identification purposes. Because greater numbers of 
polymorphisms occur in the noncoding regions, fewer sequences are necessary to 
differentiate individuals. The noncoding sequences of SEQ ID NO:l or 4 can 
comfortably provide positive individual identification with a panel of perhaps 10 to 
1 5 000 primers which each yield a noncoding amplified sequence of 100 bases. If 
predicted coding sequences, such as those in SEQ ID NO:3 or 6 are used, a more 
appropriate number of primers for positive individual identification would be 500-2,000. 

If a panel of reagents from TLCC-4 or TLCC-5 nucleotide sequences described 
herein is used to generate a unique identification database for an individual, those same 
reagents can later be used to identify tissue from that individual. Using the unique 
identification database, positive identification of the individual, living or dead, can be 
made from extremely small tissue samples. 

20 3. Use of TLCC-4 and TLCC-5 Sequences in Forensic Biology 

DNA-based identification techniques can also be used in forensic biology. 
Forensic biology is a scientific field employing genetic typing of biological evidence 
found at a crime scene as a means for positively identifying, for example, a perpetrator 
of a crime. To make such an identification, PCR technology can be used to amplify 

25 DNA sequences taken from very small biological samples such as tissues, e.g., hair or 
skin, or body fluids, e.g., blood, saliva, or semen found at a crime scene. The amplified 
sequence can then be compared to a standard, thereby allowing identification of the 
origin of the biological sample. 

The sequences of the present invention can be used to provide polynucleotide 

30 reagents, e.g. , PCR primers, targeted to specific loci in the human genome, which can 
enhance the reliability of DNA-based forensic identifications by, for example, providing 
another "identification marker" (i.e. another DNA sequence that is unique to a particular 
individual). As mentioned above, actual base sequence information can be used for 
identification as an accurate alternative to patterns formed by restriction enzyme 

35 generated fragments. Sequences targeted to noncoding regions of SEQ ID NO:l or 4 are 
particularly appropriate for this use as greater numbers of polymorphisms occur in the 
noncoding regions, making it easier to differentiate individuals using this technique. 
Examples of polynucleotide reagents include the TLCC-4 or TLCC-5 nucleotide 
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sequences or portions thereof, e.g., fragments derived from the noncoding regions of 
SEQ ID NO:l or 4 having a length of at least 20 bases, preferably at least 30 bases. 

The TLCC-4 or TLCC-5 nucleotide sequences described herein can further be 
used to provide polynucleotide reagents, e.g., labeled or labelable probes which can be 
5 used in, for example, an in situ hybridization technique, to identify a specific tissue, e.g., 
brain tissue. This can be very useful in cases where a forensic pathologist is presented 
with a tissue of unknown origin. Panels of such TLCC-4 or TLCC-5 probes can be used 
to identify tissue by species and/or by organ type. 

In a similar fashion, these reagents, e.g., TLCC-4 or TLCC-5 primers or probes 
10 can be used to screen tissue culture for contamination {i.e. screen for the presence of a 
mixture of different types of cells in a culture). 

C. Predictive Medicine : 

The present invention also pertains to the field of predictive medicine in which 
15 diagnostic assays, prognostic assays, and monitoring clinical trials are used for 
prognostic (predictive) purposes to thereby treat an individual prophylactically. 
Accordingly, one aspect of the present invention relates to diagnostic assays for 
determining TLCC-4 or TLCC-5 polypeptide and/or nucleic acid expression as well as 
TLCC-4 or TLCC-5 activity, in the context of a biological sample (e.g., blood, serum, 
20 cells, tissue) to thereby determine whether an individual is afflicted with a disease or 
disorder, or is at risk of developing a disorder, associated with aberrant or unwanted 
TLCC-4 or TLCC-5 expression or activity. The invention also provides for prognostic 
(or predictive) assays for determining whether an individual is at risk of developing a 
disorder associated with TLCC-4 or TLCC-5 polypeptide, nucleic acid expression or 
25 activity. For example, mutations in a TLCC-4 or TLCC-5 gene can be assayed in a 
biological sample. Such assays can be used for prognostic or predictive purpose to 
thereby prophylactically treat an individual prior to the onset of a disorder characterized 
by or associated with TLCC-4 or TLCC-5 polypeptide, nucleic acid expression or 
activity. 

30 . Another aspect of the invention pertains to monitoring the influence of agents 

(e.g., drugs, compounds) on the expression or activity of TLCC-4 or TLCC-5 in clinical 
trials. 

These and other agents are described in further detail in the following sections. 

35 1. Diagnostic Assays 

An exemplary method for detecting the presence or absence of TLCC-4 or 
TLCC-5 polypeptide or nucleic acid in a biological sample involves obtaining a 
biological sample from a test subject and contacting the biological sample with a 
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compound or an agent capable of detecting TLCC-4 or TLCC-5 polypeptide or nucleic 
acid (e.g., mRNA, or genomic DNA) that encodes TLCC-4 or TLCC-5 polypeptide such 
that the presence of TLCC-4 or TLCC-5 polypeptide or nucleic acid is detected in the 
biological sample. In another aspect, the present invention provides a method for 
5 detecting the presence of TLCC-4 or TLCC-5 activity in a biological sample by 
contacting the biological sample with an agent capable of detecting an indicator of 
TLCC-4 or TLCC-5 activity such that the presence of TLCC-4 or TLCC-5 activity is 
detected in the biological sample. A preferred agent for detecting TLCC-4 or TLCC-5 
mRNA or genomic DNA is a labeled nucleic acid probe capable of hybridizing to 

10 . TLCC-4 or TLCC-5 mRNA or genomic DNA. The nucleic acid probe can be, for 

example, the TLCC-4 or TLCC-5 nucleic acid set forth in SEQ ID NO:l, 3, 4, or 6, or 

the DNA inserts of the plasmids deposited with ATCC as Accession Numbers , or 

a portion thereof, such as an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 
nucleotides in length and sufficient to specifically hybridize under stringent conditions 

15 to TLCC-4 or TLCC-5 mRNA or genomic DNA. Other suitable probes for use in the 
diagnostic assays of the invention are described herein. 

A preferred agent for detecting TLCC-4 or TLCC-5 polypeptide is an antibody 
capable of binding to TLCC-4 or TLCC-5 polypeptide, preferably an antibody with a 
detectable label. Antibodies can be polyclonal, or more preferably, monoclonal. An 

20 intact antibody, or a fragment thereof (e.g., Fab or F(ab')2) can be used. The term 

"labeled", with regard to the probe or antibody, is intended to encompass direct labeling 
of the probe or antibody by coupling (i.e., physically linking) a detectable substance to 
the probe or antibody, as well as indirect labeling of the probe or antibody by reactivity 
with another reagent that is directly labeled. Examples of indirect labeling include 

25 detection of a primary antibody using a fluorescently labeled secondary antibody and 
end-labeling of a DNA probe with biotin such that it can be detected with fluorescently 
labeled streptavidin. The term "biological sample" is intended to include tissues, cells 
and biological fluids isolated from a subject, as well as tissues, cells and fluids present 
within a subject. That is, the detection method of the invention can be used to detect 

30 TLCC-4 or TLCC-5 mRNA, polypeptide, or genomic DNA in a biological sample in 
vitro as well as in vivo. For example, in vitro techniques for detection of TLCC-4 or 
TLCC-5 mRNA include Northern hybridizations and in situ hybridizations. In vifro 
techniques for detection of TLCC-4 or TLCC-5 polypeptide include enzyme linked 
immunosorbent assays (ELISAs), Western blots, immunoprecipitations and 

35 immunofluorescence. In vitro techniques for detection of TLCC-4 or TLCC-5 genomic 
DNA include Southern hybridizations. Furthermore, in vivo techniques for detection of 
TLCC-4 or TLCC-5 polypeptide include introducing into a subject a labeled anti-TLCC- 
4 or anti-TLCC-5 antibody. For example, the antibody can be labeled with a radioactive 
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marker whose presence and location in a subject can be detected by standard imaging 
techniques. 

The present invention also provides diagnostic assays for identifying the 
presence or absence of a genetic alteration characterized by at least one of (i) aberrant 
5 modification or mutation of a gene encoding a TLCC-4 or TLCC-5 polypeptide; (ii) 
aberrant expression of a gene encoding a TLCC-4 or TLCC-5 polypeptide; (iii) mis- 
regulation of the gene; and (iii) aberrant post-translational modification of a TLCC-4 or 
TLCC-5 polypeptide, wherein a wild-type form of the gene encodes a protein with a 
TLCC-4 or TLCC-5 activity. "Misexpression or aberrant expression", as used herein, 

10 refers to a non-wild type pattern of gene expression, at the RNA or protein level. It 
includes, but is not limited to, expression at non-wild type levels (e.g., over or under 
expression); a pattern of expression that differs from wild type in terms of the time or 
stage at which the gene is expressed (e.g., increased or decreased expression (as 
compared with wild type) at a predetermined developmental period or stage); a pattern 

15 of expression that differs from wild type in terms of decreased expression (as compared 
with wild type) in a predetermined cell type or tissue type; a pattern of expression that 
differs from wild type in terms of the splicing size, amino acid sequence, post- 
transitional modification, or biological activity of the expressed polypeptide; a pattern of 
expression that differs from wild type in terms of the effect of an environmental 

20 stimulus or extracellular stimulus on expression of the gene (e.g. , a pattern of increased 
or decreased expression (as compared with wild type) in the presence of an increase or 
decrease in the strength of the stimulus). 

In one embodiment, the biological sample contains polypeptide molecules from 
the test subject. Alternatively, the biological sample can contain mRNA molecules from 

25 the test subject or genomic DNA molecules from the test subject. A preferred biological 
sample is a serum sample isolated by conventional means from a subject. 

In another embodiment, the methods further involve obtaining a control 
biological sample from a control subject, contacting the control sample with a 
compound or agent capable of detecting TLCC-4 or TLCC-5 polypeptide, mRNA, or 

30 genomic DNA, such that the presence of TLCC-4 or TLCC-5 polypeptide, mRNA or 
genomic DNA is detected in the biological sample, and comparing the presence of 
TLCC-4 or TLCC-5 polypeptide, mRNA or genomic DNA in the control sample with 
the presence of TLCC-4 or TLCC-5 polypeptide, mRNA or genomic DNA in the test 
sample. 

35 The invention also encompasses kits for detecting the presence of TLCC-4 or 

TLCC-5 in a biological sample. For example, the kit can comprise a labeled compound 
or agent capable of detecting TLCC-4 or TLCC-5 polypeptide or mRNA in a biological 
sample; means for determining the amount of TLCC-4 or TLCC-5 in the sample; and 
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means for comparing the amount of TLCC-4 or TLCC-5 in the sample with a standard. 
The compound or agent can be packaged in a suitable container. The kit can further 
comprise instructions for using the kit to detect TLCC-4 or TLCC-5 polypeptide or 
nucleic acid. 

5 

2. Prognostic Assays 

The diagnostic methods described herein can furthermore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated with aberrant or 
unwanted TLCC-4 or TLCC-5 expression or activity. As used herein, the term 
10 "aberrant" includes a TLCC-4 or TLCC-5 expression or activity which deviates from the 
wild type TLCC-4 or TLCC-5 expression or activity. Aberrant expression or activity 
includes increased or decreased expression or activity, as well as expression or activity 
which does not follow the wild type developmental pattern of expression or the 
subcellular pattern of expression. For example, aberrant TLCC-4 or TLCC-5 expression 
15 or activity is intended to include the cases in which a mutation in the TLCC-4 or TLCC- 
5 gene causes the TLCC-4 or TLCC-5 gene to be under-expressed or over-expressed and 
situations in which such mutations result in a non-functional TLCC-4 or TLCC-5 
polypeptide or a polypeptide which does not function in a wild-type fashion, e.g., a 
polypeptide which does not interact with a TLCC-4 or TLCC-5 substrate, e.g., a non- 
20 calcium channel subunit or ligand and/or a non-vanilloid receptor subunit or ligand, or 
one which interacts with a non-TLCC-4 or non-TLCC-5 substrate, e.g. a non-calcium 
channel subunit or ligand and/or a non-vanilloid receptor subunit or ligand. As used 
herein, the term "unwanted" includes an unwanted phenomenon involved in a biological 
response, such as cellular proliferation. For example, the term unwanted includes a 
25 TLCC-4 or TLCC-5 expression or activity which is undesirable in a subject. 

The assays described herein, such as the preceding diagnostic assays or the 
following assays, can be utilized to identify a subject having or at risk of developing a 
disorder associated with a misregulation in TLCC-4 or TLCC-5 polypeptide activity or 
nucleic acid expression, such as a CNS disorder (e.g., a neurodegenerative disorder, a 
30 pain disorder, or a cellular proliferation, growth, differentiation, or migration disorder). 
Alternatively, the prognostic assays can be utilized to identify a subject having or at risk 
for developing a disorder associated with a misregulation in TLCC-4 or TLCC-5 
polypeptide activity or nucleic acid expression, such as a CNS disorder, a pain disorder, 
or a cellular proliferation, growth, differentiation, or migration disorder. Thus, the 
35 present invention provides a method for identifying a disease or disorder associated with 
aberrant or unwanted TLCC-4 or TLCC-5 expression or activity in which a test sample 
is obtained from a subject and TLCC-4 or TLCC-5 polypeptide or nucleic acid (e.g., 
mRNA or genomic DNA) is detected, wherein the presence of TLCC-4 or TLCC-5 
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polypeptide or nucleic acid is diagnostic for a subject having or at risk of developing a 
disease or disorder associated with aberrant or unwanted TLCC-4 or TLCC-5 expression 
or activity. As used herein, a "test sample" refers to a biological sample obtained from a 
subject of interest. For example, a test sample can be a biological fluid (e.g. , serum), 
5 cell sample, or tissue. 

Furthermore, the prognostic assays described herein can be used to determine 
whether a subject can be administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate) 
to treat a disease or disorder associated with aberrant or unwanted TLCC-4 or TLCC-5 

10 expression or activity. For example, such methods can be used to determine whether a 
subject can be effectively treated with an agent for a CNS disorder, pain disorder, or a 
cellular proliferation, growth, differentiation, or migration disorder. Thus, the present 
invention provides methods for determining whether a subject can be effectively treated 
with an agent for a disorder associated with aberrant or unwanted TLCC-4 or TLCC-5 

15 expression or activity in which a test sample is obtained and TLCC-4 or TLCC-5 
polypeptide or nucleic acid expression or activity is detected (e.g., wherein the 
abundance of TLCC-4 or TLCC-5 polypeptide or nucleic acid expression or activity is 
diagnostic for a subject that can be administered the agent to treat a disorder associated 
with aberrant or unwanted TLCC-4 or TLCC-5 expression or activity). 

20 The methods of the invention can also be used to detect genetic alterations in a 

TLCC-4 or TLCC-5 gene, thereby determining if a subject with the altered gene is at 
risk for a disorder characterized by misregulation in TLCC-4 or TLCC-5 polypeptide 
activity or nucleic acid expression, such as a CNS disorder, pain disorder, or a disorder 
of cellular growth, differentiation, or migration. In preferred embodiments, the methods 

25 include detecting, in a sample of cells from the subject, the presence or absence of a 

genetic alteration characterized by at least one of an alteration affecting the integrity of a 
gene encoding a TLCC-4 or TLCC-5 - polypeptide, or the mis-expression of the TLCC- 
4 or TLCC-5 gene. For example, such geiietic alterations can be detected by 
ascertaining the existence of at least one of the following: 1) a deletion of one or more 

30 nucleotides from a TLCC-4 or TLCC-5 gene; 2) an addition of one or more nucleotides 
to a TLCC-4 or TLCC-5 gene; 3) a substitution of one or more nucleotides of a TLCC-4 
or TLCC-5 gene; 4) a chromosomal rearrangement of a TLCC-4 or TLCC-5 gene; 5) an 
alteration in the level of a messenger RNA transcript of a TLCC-4 or TLCC-5 gene; 6) 
aberrant modification of a TLCC-4 or TLCC-5 gene, such as of the methylation pattern 

35 of the genomic DNA; 7) the presence of a non-wild type splicing pattern of a messenger 
RNA transcript of a TLCC-4 or TLCC-5 gene; 8) a non-wild type level of a TLCC-4 or 
TLCC-5 polypeptide; 9) allelic loss of a TLCC-4 or TLCC-5 gene; and 10) 
inappropriate post-translational modification of a TLCC-4 or TLCC-5 polypeptide. As 
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described herein, there are a large number of assays known in the art which can be used 
for detecting alterations in a TLCC-4 or TLCC-5 gene. A preferred biological sample is 
a tissue or serum sample isolated by conventional means from a subject. 

In certain embodiments, detection of the alteration involves the use of a 
5 probe/primer in a polymerase chain reaction (PCR) (see, e.g. , U.S. Patent Nos. 

4,683,195 and 4,683,202), such as anchor PCR or RACE PCR, or, alternatively, in a 
ligation chain reaction (LCR) (see, e.g., Landegran et al (1988) Science 241 .1077-1080; 
andNakazawa et al (1994) Proc. Natl. Acad. Set USA 91:360-364), the latter of which 
can be particularly useful for detecting point mutations in the TLCC-4 or TLCC-5 gene 

10 (see, e.g., Abravaya et al. (1995) Nucleic Acids Res .23:675-682). This method can 
include the steps of collecting a sample of cells from a subject, isolating nucleic acid 
{e.g., genomic, mRNA or both) from the cells of the sample, contacting the nucleic acid 
sample with one or more primers which specifically hybridize to a TLCC-4 or TLCC-5 
gene under conditions such that hybridization and amplification of the TLCC-4 or 

15 TLCC-5 gene (if present) occurs, and detecting the presence or absence of an 

amplification product, or detecting the size of the amplification product and comparing 
the length to a control sample. It is anticipated that PCR and/or LCR may be desirable 
to use as a preliminary amplification step in conjunction with any of the techniques used 
for detecting mutations described herein. 

20 Alternative amplification methods include: self sustained sequence replication 

(Guatelli, J.C. et al, (1990) Proc. Natl Acad. Sci. USA 87:1874-1878), transcriptional 
amplification system (Kwoh, D.Y. et al, (1989) Proc. Natl Acad. Sci. USA 86:1 173- 
1 177), Q-Beta Replicase (Lizardi, P.M. et al. (1988) Bio-Technology 6:1 197), or any 
other nucleic acid amplification method, followed by the detection of the amplified 

25 molecules using techniques well known to those of skill in the art. These detection 
schemes are especially useful for the detection of nucleic acid molecules if such 
molecules are present in very low numbers. 

In an alternative embodiment, mutations in a TLCC-4 or TLCC-5 gene from a 
sample cell can be identified by alterations in restriction enzyme cleavage patterns. For 

30 example, sample and control DNA is isolated, amplified (optionally), digested with one 
or more restriction endonucleases, and fragment length sizes are determined by gel 
electrophoresis and compared. Differences in fragment length sizes between sample and 
control DNA indicates mutations in the sample DNA. Moreover, the use of sequence 
specific ribozymes (see, for example, U.S. Patent No. 5,498,531) can be used to score 

35 for the presence of specific mutations by development or loss of a ribozyme cleavage 
site. 

In other embodiments, genetic mutations in TLCC-4 or TLCC-5 can be 
identified by hybridizing a sample and control nucleic acids, e.g., DNA or RNA, to high 
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density arrays containing hundreds or thousands of oligonucleotides probes (Cronin, 
M.T. et al (1996) Human Mutation 7: 244-255; Kozal, M.J. et al. (1996) Nature 
Medicine 2: 753-759). For example, genetic mutations in TLCC-4 or TLCC-5 can be 
identified in two dimensional arrays containing light-generated DNA probes as 
5 described in Cronin, M.T. et al. supra. Briefly, a first hybridization array of probes can 
be used to scan through long stretches of DNA in a sample and control to identify base 
changes between the sequences by making linear arrays of sequential overlapping 
probes. This step allows the identification of point mutations. This step is followed by 
a second hybridization array that allows the characterization of specific mutations by 

10 using smaller, specialized probe arrays complementary to all variants or mutations 

detected. Each mutation array is composed of parallel probe sets, one complementary to 
the wild-type gene and the other complementary to the mutant gene. 

In yet another embodiment, any of a variety of sequencing reactions known in 
the art can be used to directly sequence the TLCC-4 or TLCC-5 gene and detect 

1 5 mutations by comparing the sequence of the sample TLCC-4 or TLCC-5 with the 

corresponding wild-type (control) sequence. Examples of sequencing reactions include 
those based on techniques developed by Maxam and Gilbert ((1977) Proc. Natl. Acad. 
Sci. USA 74:560) or Sanger ((1977) Proc. Natl. Acad. Sci. USA 74:5463). It is also 
contemplated that any of a variety of automated sequencing procedures can be utilized 

20 when performing the diagnostic assays ((1995) Biotechniques 19:448), including 
sequencing by mass spectrometry (see, e.g., PCT International Publication No. WO 
94/16101; Cohen et al. (\996)Adv. Chromatogr. 36:127-162; and Griffin et al (1993) 
Appl Biochem. Biotechnol. 38:147-159). 

Other methods for detecting mutations in the TLCC-4 or TLCC-5 gene include 

25 methods in which protection from cleavage agents is used to detect mismatched bases in 
RNA/RNA or RNA/DNA heteroduplexes (Myers et al. (1985) Science 230:1242). In 
general, the art technique of "mismatch cleavage' 5 starts by providing heteroduplexes of 
formed by hybridizing (labeled) RNA or DNA containing the wild-type TLCC-4 or 
TLCC-5 sequence with potentially mutant RNA or DNA obtained from a tissue sample. 

30 The double-stranded duplexes are treated with an agent which cleaves single-stranded 
regions of the duplex such as which will exist due to basepair mismatches between the 
control and sample strands. For instance, RNA/DNA duplexes can be treated with 
RNase and DNA/DNA hybrids treated with SI nuclease to enzymatically digesting the 
mismatched regions. In other embodiments, either DNA/DNA or RNA/DNA duplexes 

35 can be treated with hydroxylamine or osmium tetroxide and with piperidine in order to 
digest mismatched regions. After digestion of the mismatched regions, the resulting 
material is then separated by size on denaturing polyacrylamide gels to determine the 
site of mutationj(see, for example, Cotton et al. (1988) Proc. Natl Acad Sci USA 
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85:4397; Saleeba et al (1992) Methods Enzymol 217:286-295). In a preferred 
embodiment, the control DNA or RN A can be labeled for detection. 

In still another embodiment, the mismatch cleavage reaction employs one or 
more proteins that recognize mismatched base pairs in double-stranded DNA (so called 
5 "DNA mismatch repair" enzymes) in defined systems for detecting and mapping point 
mutations in TLCC-4 or TLCC-5 cDNAs obtained from samples of cells. For example, 
the mutY enzyme of E. coli cleaves A at G/A mismatches and the thymidine DNA 
glycosylase from HeLa cells cleaves T at G/T mismatches (Hsu et al. (1994) 
Carcinogenesis 15:1657-1662). According to an exemplary embodiment, a probe based 

10 on a TLCC-4 or TLCC-5 sequence, e.g., a wild-type TLCC-4 or TLCC-5 sequence, is 
hybridized to a cDNA or other DNA product from a test cell(s). The duplex is treated 
with a DNA mismatch repair enzyme, and the cleavage products, if any,' can be detected 
from electrophoresis protocols or the like (see, for example, U.S. Patent No. 5,459,039). 
In other embodiments, alterations in electrophoretic mobility will be used to 

15 identify mutations in TLCC-4 or TLCC-5 genes. For example, single strand 
conformation polymorphism (SSCP) may be used to detect differences in 
electrophoretic mobility between mutant and wild type nucleic acids (Orita et al (1989) 
Proc Natl. Acad. Sci USA: 86:2766, see also Cotton (1993) Mutat Res. 285:125-144; 
and Hayashi (1992) Genet. Anal. Tech. Appl. 9:73-79). Single-stranded DNA fragments 

20 of sample and control TLCC-4 or TLCC-5 nucleic acids will be denatured and allowed 
to renature. The secondary structure of single-stranded nucleic acids varies according to 
sequence, the resulting alteration in electrophoretic mobility enables the detection of 
even a single base change. The DNA fragments may be labeled or detected with labeled 
probes. The sensitivity of the assay may be enhanced by using RNA (rather than DNA), 

25 in which the secondary structure is more sensitive to a change in sequence. In a 

preferred embodiment, the subject method utilizes heteroduplex analysis to separate 
double stranded heteroduplex molecules on the basis of changes in electrophoretic 
mobility (Keen et al (1991) Trends Genet 7:5). 

In yet another embodiment the movement of mutant or wild-type fragments in 

30 poly aery lamide gels containing a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (DGGE) (Myers et al. (1985) Nature 313:495). When 
DGGE is used as the method of analysis, DNA will be modified to insure that it does not 
completely denature, for example by adding a GC clamp of approximately 40 bp of 
high-melting GC-rich DNA by PCR. In a further embodiment, a temperature gradient is 

35 used in place of a denaturing gradient to identify differences in the mobility of control 
and sample DNA (Rosenbaum and Reissner (1987) Biophys Chem 265:12753). 

Examples of other techniques for detecting point mutations include, but are not 
limited to, selective oligonucleotide hybridization, selective amplification, or selective 
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primer extension. For example, oligonucleotide primers may be prepared in which the 
known mutation is placed centrally and then hybridized to target DNA under conditions 
which permit hybridization only if a perfect match is found (Saiki et al (1986) Nature 
324:163); Saiki et al (1989) Proc. Natl Acad. Sci USA 86:6230). Such allele specific 
5 oligonucleotides are hybridized to PCR amplified target DNA or a number of different 
mutations when the oligonucleotides are attached to the hybridizing membrane and 
hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology which depends on selective 
PCR amplification may be used in conjunction with the instant invention. 

10 Oligonucleotides used as primers for specific amplification may carry the mutation of 
interest in the center of the molecule (so that amplification depends on differential 
hybridization) (Gibbs et al. (1989) Nucleic Acids Res. 17:2437-2448) or at the extreme 3' 
end of one primer where, under appropriate conditions, mismatch can prevent, or reduce 
polymerase extension (Prossner (1993) Tibtech 1 1 :238). In addition it may be desirable 

15 to introduce a novel restriction site in the region of the mutation to create cleavage-based 
detection (Gasparini et al. (1992) Mol. Cell Probes 6:1). It is anticipated that in certain 
embodiments amplification may also be performed using Taq ligase for amplification 
(Barany (1991) Proc. Natl Acad. Sci USA 88:189). In such cases, ligation will occur 
only if there is a perfect match at the 3 ! end of the 5' sequence making it possible to detect 

20 the presence of a known mutation at a specific site by looking for the presence or absence 
of amplification. 

The methods described herein may be performed, for example, by utilizing pre- 
packaged diagnostic kits comprising at least one probe nucleic acid or antibody reagent 
described herein, which may be conveniently used, e.g., in clinical settings to diagnose 
25 patients exhibiting symptoms or family history of a disease or illness involving a TLCC- 
4 or TLCC-5 gene. 

Furthermore, any cell type or tissue in which TLCC-4 or TLCC-5 is expressed 
may be utilized in the prognostic assays described herein. 

30 3. Monitoring of Effects During Clinical Trials 

Monitoring the influence of agents (e.g., drugs) on the expression or activity of a 
TLCC-4 or TLCC-5 polypeptide (e.g., the modulation of membrane excitability) can be 
applied not only in basic drug screening, but also in clinical trials. For example, the 
effectiveness of an agent determined by a screening assay as described herein to increase 

35 TLCC-4 or TLCC-5 gene expression, polypeptide levels, or upregulate TLCC-4 or 
TLCC-5 activity, can be monitored in clinical trials of subjects exhibiting decreased 
TLCC-4 or TLCC-5 gene expression, polypeptide levels, or downregulated TLCC-4 or 
TLCC-5 activity. Alternatively, the effectiveness of an agent determined by a screening 
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assay to decrease TLCC-4 or TLCC-5 gene expression, polypeptide levels, or 
downregulate TLCC-4 or TLCC-5 activity, can be monitored in clinical trials of subjects 
exhibiting increased TLCC-4 or TLCC-5 gene expression, polypeptide levels, or 
upregulated TLCC-4 or TLCC-5 activity. In such clinical trials, the expression or 
5 activity of a TLCC-4 or TLCC-5 gene, and preferably, other genes that have been 
implicated in, for example, a TLCC-4-associated disorder or TLCC-5-associated 
disorder can be used as a "read out" or markers of the phenotype of a particular cell. 

For example, and not by way of limitation, genes, including TLCC-4 or TLCC-5, 
that are modulated in cells by treatment with an agent (e.g., compound, drug or small 

10 molecule) which modulates TLCC-4 or TLCC-5 activity (e.g., identified in a screening 
assay as described herein) can be identified. Thus, to study the effect of agents on 
TLCC-4-associated disorders or TLCC-5 -associated disorders (e.g., disorders 
characterized by deregulated signaling or membrane excitation), for example, in a 
clinical trial, cells can be isolated and RNA prepared and analyzed for the levels of 

15 expression of TLCC-4 or TLCC-5 and other genes implicated in the TLCC-4-associated 
disorder or TLCC-5-associated disorder, respectively. The levels of gene expression 
(e g., a gene expression pattern) can be quantified by northern blot analysis or RT-PCR, 
as described herein, or alternatively by measuring the amount of polypeptide produced, 
by one of the methods as described herein, or by measuring the levels of activity of 

20 TLCC-4 or TLCC-5 or other genes. In this way, the gene expression pattern can serve 
as a marker, indicative of the physiological response of the cells to the agent. 
Accordingly, this response state may be determined before, and at various points during 
treatment of the individual with the agent. 

In a preferred embodiment, the present invention provides a method for 

25 monitoring the effectiveness of treatment of a subject with an agent (e.g., an agonist, 
antagonist, peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug 
candidate identified by the screening assays described herein) including the steps of (i) 
obtaining a pre-administration sample from a subject prior to administration of the 
agent; (ii) detecting the level of expression of a TLCC-4 or TLCC-5 polypeptide, 

30 mRNA, or genomic DNA in the preadministration sample; (iii) obtaining one or more 
post-administration samples from the subject; (iv) detecting the level of expression or 
activity of the TLCC-4 or TLCC-5 polypeptide, mRNA, or genomic DNA in the post- 
administration samples; (v) comparing the level of expression or activity of the TLCC-4 
or TLCC-5 polypeptide, mRNA, or genomic DNA in the pre-administration sample with 

35 the TLCC-4 or TLCC-5 polypeptide, mRNA, or genomic DNA in the post 

administration sample or samples; and (vi) altering the administration of the agent to the 
subject accordingly. For example, increased administration of the agent may be 
desirable to increase the expression or activity of TLCC-4 or TLCC-5 to higher levels 
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than detected, i.e., to increase the effectiveness of the agent. Alternatively, decreased 
administration of the agent may be desirable to decrease expression or activity of TLCC- 
4 or TLCC-5 to lower levels than detected, i.e. to decrease the effectiveness of the agent. 
According to such an embodiment, TLCC-4 or TLCC-5 expression or activity may be 
5 used as an indicator of the effectiveness of an agent, even in the absence of an 
observable phenotypic response. 

4. Electronic Apparatus Readable Media and Arrays 

Electronic apparatus readable media comprising TLCC-4 or TLCC-5 sequence 

10 information is also provided. As used herein, "TLCC-4 or TLCC-5 sequence 

information" refers to any nucleotide and/or amino acid sequence information particular 
to the TLCC-4 or TLCC-5 molecules of the present invention, including but not limited 
to full-length nucleotide and/or amino acid sequences, partial nucleotide and/or amino 
acid sequences, polymorphic sequences including single nucleotide polymorphisms 

15 (SNPs), epitope sequences, and the like. Moreover, information "related to" said 

TLCC-4 or TLCC-5 sequence information includes detection of the presence or absence 
of a sequence (e.g., detection of expression of a sequence, fragment, polymorphism, 
etc.), determination of the level of a sequence (e.g. , detection of a level of expression, 
for example, a quantitative detection), detection of a reactivity to a sequence (e.g., 

20 detection of protein expression and/or levels, for example, using a sequence-specific 
antibody), and the like. As used herein, "electronic apparatus readable media" refers to 
any suitable medium for storing, holding or containing data or information that can be 
read and accessed directly by an electronic apparatus. Such media can include, but are 
not limited to: magnetic storage media, such as floppy discs, hard disc storage medium, 

25 and magnetic tape; optical storage media such as compact disc; electronic storage media 
such as RAM, ROM, EPROM, EEPROM and the like; general hard disks and hybrids of 
these categories such as magnetic/optical storage media. The medium is adapted or 
configured for having recorded thereon TLCC-4 or TLCC-5 sequence information of the 
present invention. 

30 As used herein, the term "electronic apparatus" is intended to include any 

suitable computing or processing apparatus or other device configured or adapted for 
storing data or information. Examples of electronic apparatus suitable for use with the 
present invention include stand-alone computing apparatus; networks, including a local 
area network (LAN), a wide area network (WAN) Internet, Intranet, and Extranet; 

35 electronic appliances such as a personal digital assistants (PDAs), cellular phone, pager 
and the like; and local and distributed processing systems. 

As used herein, "recorded" refers to a process for storing or encoding 
information on the electronic apparatus readable medium. Those skilled in the art can 
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readily adopt any of the presently known methods for recording information on known 
media to generate manufactures comprising the TLCC-4 or TLCC-5 sequence 
information. 

A variety of software programs and formats can be used to store the sequence 
5 information on the electronic apparatus readable medium. For example, the sequence 
information can be represented in a word processing text file, formatted in 
commercially-available software such as WordPerfect and MicroSoft Word, or 
represented in the form of an ASCII file, stored in a database application, such as DB2, 
Sybase, Oracle, or the like, as well as in other forms. Any number of dataprocessor 

10 structuring formats (e.g., text file or database) may be employed in order to obtain or 
create a medium having recorded thereon the TLCC-4 or TLCC-5 sequence information. 

By providing TLCC-4 or TLCC-5 sequence information in readable form, one 
can routinely access the sequence information for a variety of purposes. For example, 
one skilled in the art can use the sequence information in readable form to compare a 

15 target sequence or target structural motif with the sequence information stored within 
the data storage means. Search means are used to identify fragments or regions of the 
sequences of the invention which match a particular target sequence or target motif. 

The present invention therefore provides a medium for holding instructions for 
performing a method for determining whether a subject has a TLCC-4 or TLCC-5- 

20 associated disease or disorder or a pre-disposition to a TLCC-4 or TLCC-5-associated 
disease or disorder, wherein the method comprises the steps of determining TLCC-4 or 
TLCC-5 sequence information associated with the subject and based on the TLCC-4 or 
TLCC-5 sequence information, determining whether the subject has a TLCC-4 or 
TLCC-5-associated disease or disorder or a pre-disposition to a TLCC-4 or TLCC-5 - 

25 associated disease or disorder and/or recommending a particular treatment for the 
disease, disorder or pre-disease condition. 

The present invention further provides in an electronic system and/or in a 
network, a method for determining whether a subject has a TLCC-4 or TLCC-5- 
associated disease or disorder or a pre-disposition to a disease associated with a TLCC-4 

30 or TLCC-5 wherein the method comprises the steps of determining TLCC-4 or TLCC-5 
sequence information associated with the subject, and based on the TLCC-4 or TLCC-5 
sequence information, determining whether the subject has a TLCC-4 or TLCC-5- 
associated disease or disorder or a pre-disposition to a TLCC-4 or TLCC-5-associated 
disease or disorder, and/or recommending a particular treatment for the disease, disorder 

35 or pre-disease condition. The method may further comprise the step of receiving 
phenotypic information associated with the subject and/or acquiring from a network 
phenotypic information associated with the subject. 



76 



NSDOCID: <WO 0200722A2_I_> 



WO 02/00722 



PCT/USOI/20640 



The present invention also provides in a network, a method for determining 
whether a subject has a TLCC-4 or TLCC-5 -associated disease or disorder or a pre- 
disposition to a TLCC-4 or TLCC-5-associated disease or disorder associated with 
TLCC-4 or TLCC-5, said method comprising the steps of receiving TLCC-4 or TLCC-5 
5 sequence information from the subject and/or information related thereto, receiving 
phenotypic information associated with the subject, acquiring information from the 
network corresponding to TLCC-4 or TLCC-5 and/or a TLCC-4 or TLCC-5-associated 
disease or disorder, and based on one or more of the phenotypic information, the TLCC- 
4 or TLCC-5 information (e.g., sequence information and/or information related 

10 thereto), and the acquired information, determining whether the subject has a TLCC-4 or 
TLCC-5-associated disease or disorder or a pre-disposition to a TLCC-4 or TLCC-5- 
associated disease or disorder. The method may further comprise the step of 
recommending a particular treatment for the disease, disorder or pre-disease condition. 

The present invention also provides a business method for determining whether a 

15 subject has a TLCC-4 or TLCC-5-associated disease or disorder or a pre-disposition to a 
TLCC-4 or TLCC-5-associated disease or disorder, said method comprising the steps of 
receiving information related to TLCC-4 or TLCC-5 (e.g., sequence information and/or 
information related thereto), receiving phenotypic information associated with the 
subject, acquiring information from the network related to TLCC-4 or TLCC-5 and/or 

20 related to a TLCC-4 or TLCC-5-associated disease or disorder, and based on one or 
more of the phenotypic information, the TLCC-4 or TLCC-5 information, and the 
acquired information, determining whether the subject has a TLCC-4 or TLCC-5- 
associated disease or disorder or a pre-disposition to a TLCC-4 or TLCC-5-associated 
disease or disorder. The method may further comprise the step of recommending a 

25 particular treatment for the disease, disorder or pre-disease condition. 

The invention also includes an array comprising a TLCC-4 or TLCC-5 sequence 
of the present invention. The array can be used to assay expression of one or more 
genes in the array. In one embodiment, the array can be used to assay gene expression 
in a tissue to ascertain tissue specificity of genes in the array. In this manner, up to 

30 about 7600 genes can be simultaneously assayed for expression, one of which can be 
TLCC-4 or TLCC-5. This allows a profile to be developed showing a battery of genes 
specifically expressed in one or more tissues. 

In addition to such qualitative determination, the invention allows the 
quantitation of gene expression. Thus, not only tissue specificity, but also the level of 

35 expression of a battery of genes in the tissue is ascertainable. Thus, genes can be 
grouped on the basis of their tissue expression per se and level of expression in that 
tissue. This is useful, for example, in ascertaining the relationship of gene expression 
between or among tissues. Thus, one tissue can be perturbed and the effect on gene 



77 



WO 02/00722 



PCT/USO 1/20640 



expression in a second tissue can be determined. In this context, the effect of one cell 
type on another cell type in response to a biological stimulus can be determined. Such a 
determination is useful, for example, to know the effect of cell-cell interaction at the 
level of gene expression. If an agent is administered therapeutically to treat one cell 
5 type but has an undesirable effect on another cell type, the invention provides an assay 
to determine the molecular basis of the undesirable effect and thus provides the 
opportunity to co-administer a counteracting agent or otherwise treat the undesired 
effect. Similarly, even within a single cell type, undesirable biological effects can be 
determined at the molecular level. Thus, the effects of an agent on expression of other 
10 than the target gene can be ascertained and counteracted. 

In another embodiment, the array can be used to monitor the time course of 
expression of one or more genes in the array. This can occur in various biological 
contexts, as disclosed herein, for example development of a TLCC-4 or TLCC-5- 
associated disease or disorder, progression of TLCC-4 or TLCC-5-associated disease or 
15 disorder, and processes, such a cellular transformation associated with the TLCC-4 or 
TLCC-5-associated disease or disorder. 

The array is also useful for ascertaining the effect of the expression of a gene on 
the expression of other genes in the same cell or in different cells (e.g., ascertaining the 
effect of TLCC-4 or TLCC-5 expression on the expression of other genes). This 
20 provides, for example, for a selection of alternate molecular targets for therapeutic 
intervention if the ultimate or downstream target cannot be regulated. 

The array is also useful for ascertaining differential expression patterns of one or 
more genes in normal and abnormal cells. This provides a battery of genes (e.g., 
including TLCC-4 or TLCC-5) that could serve as a molecular target for diagnosis or 
25 therapeutic intervention. 

D. Methods of Treatment : 

The present invention provides for both prophylactic and therapeutic methods of 
treating a subject at risk of (or susceptible to) a disorder or having a disorder associated 

30 with aberrant or unwanted TLCC-4 or TLCC-5 expression or activity, e.g. a CNS 

disorder, pain disorder, or a cellular proliferation, growth, differentiation, or migration 
disorder. With regards to both prophylactic and therapeutic methods of treatment, such 
treatments may be specifically tailored or modified, based on knowledge obtained from 
the field of pharmacogenomics. "Pharmacogenomics", as used herein, refers to the 

35 application of genomics technologies such as gene sequencing, statistical genetics, and 
gene expression analysis to drugs in clinical development and on the market. More 
specifically, the term refers the study of how a patient's genes determine his or her 
response to a drug (e.g., a patient's "drug response phenotype", or "drug response 
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genotype"). Thus, another aspect of the invention provides methods for tailoring an 
individual's prophylactic or therapeutic treatment with either the TLCC-4 or TLCC-5 
molecules of the present invention or TLCC-4 or TLCC-5 modulators according to that 
individual's drug response genotype. Pharmacogenomics allows a clinician or physician 
5 to target prophylactic or therapeutic treatments to patients who will most benefit from 
the treatment and to avoid treatment of patients who will experience toxic drug-related 
side effects. 

Treatment is defined as the application or administration of a therapeutic agent to 
a patient, or application or administration of a therapeutic agent to an isolated tissue or 
10 cell line from a patient, who has a disease, a symptom of disease or a predisposition 
toward a disease, with the purpose to cure, heal, alleviate, relieve, alter, remedy, 
ameliorate, improve or affect the disease, the symptoms of disease or the predisposition 
toward disease. 

A therapeutic agent includes, but is not limited to, small molecules, peptides, 
15 antibodies, ribozymes and antisense oligonucleotides. 

1. Prophylactic Methods 

In one aspect, the invention provides a method for preventing in a subject, a 
disease or condition associated with an aberrant or unwanted TLCC-4 or TLCC-5 

20 expression or activity, by administering to the subject a TLCC-4 or TLCC-5 or an agent 
which modulates TLCC-4 or TLCC-5 expression or at least one TLCC-4 or TLCC-5 
activity. Subjects at risk for a disease which is caused or contributed to by aberrant or 
unwanted TLCC-4 or TLCC-5 expression or activity can be identified by, for example, 
any or a combination of diagnostic or prognostic assays as described herein. 

25 Administration of a prophylactic agent can occur prior to the manifestation of symptoms 
characteristic of the TLCC-4 or TLCC-5 aberrancy, such that a disease or disorder is 
prevented or, alternatively, delayed in its progression. Depending on the type of TLCC- 
4 or TLCC-5 aberrancy, for example, a TLCC-4 or TLCC-5 agonist, or TLCC-4 or 
TLCC-5 antagonist agent can be used for treating the subject. The appropriate agent can 

30 be determined based on screening assays described herein. 

2. Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating TLCC-4 or 
TLCC-5 expression or activity for therapeutic purposes. Accordingly, in an exemplary 
35 embodiment, the modulatory method of the invention involves contacting a cell capable 
of expressing TLCC-4 or TLCC-5 with an agent that modulates one or more of the 
activities of TLCC-4 or TLCC-5 polypeptide activity associated with the cell, such that 
TLCC-4 or TLCC-5 activity in the cell is modulated. An agent that modulates TLCC-4 
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or TLCC-5 polypeptide activity can be an agent as described herein, such as a nucleic 
acid or a polypeptide, a naturally-occurring target molecule of a TLCC-4 or TLCC-5 
polypeptide {e.g., a TLCC-4 or TLCC-5 substrate), a TLCC-4 or TLCC-5 antibody, a 
TLCC-4 or TLCC-5 agonist or antagonist, a peptidomimetic of a TLCC-4 or TLCC-5 
5 agonist or antagonist, or other small molecule. In one embodiment, the agent stimulates 
one or more TLCC-4 or TLCC-5 activities. Examples of such stimulatory agents 
include active TLCC-4 or TLCC-5 polypeptide and a nucleic acid molecule encoding 
TLCC-4 or TLCC-5 that has been introduced into the cell. In another embodiment, the 
agent inhibits one or more TLCC-4 or TLCC-5 activities. Examples of such inhibitory 

10 agents include antisense TLCC-4 or TLCC-5 nucleic acid molecules, anti-TLCC-4 or 
anti-TLCC-5 antibodies, and TLCC-4 or TLCC-5 inhibitors. These modulatory 
methods can be performed in vifro (e.g., by culturing the cell with the agent) or, 
alternatively, in vivo (e.g., by administering the agent to a subject). As such, the present 
invention provides methods of treating an individual afflicted with a disease or disorder 

15 characterized by aberrant or unwanted expression or activity of a TLCC-4 or TLCC-5 
polypeptide or nucleic acid molecule. In one embodiment, the method involves 
administering an agent (e.g., an agent identified by a screening assay described herein), 
or combination of agents that modulates (e.g., upregulates or downregulates) TLCC-4 or 
TLCC-5 expression or activity. In another embodiment, the method involves 

20 administering a TLCC-4 or TLCC-5 polypeptide or nucleic acid molecule as therapy to 
compensate for reduced, aberrant, or unwanted TLCC-4 or TLCC-5 expression or 
activity. 

Stimulation of TLCC-4 or TLCC-5 activity is desirable in situations in which 
TLCC-4 or TLCC-5 is abnormally downregulated and/or in which increased TLCC-4 or 
25 TLCC-5 activity is likely to have a beneficial effect. Likewise, inhibition of TLCC-4 or 
TLCC-5 activity is desirable in situations in which TLCC-4 or TLCC-5 is abnormally 
upregulated and/or in which decreased TLCC-4 or TLCC-5 activity is likely to have a 
beneficial effect. 

30 3. Pharmacogenomics 

The TLCC-4 or TLCC-5 molecules of the present invention, as well as agents, or 
modulators which have a stimulatory or inhibitory effect on TLCC-4 or TLCC-5 activity 
(e.g., TLCC-4 or TLCC-5 gene expression) as identified by a screening assay described 
herein can be administered to individuals to treat (prophylactically or therapeutically) 

35 TLCC-4-associated disorders or TLCC-5-associated disorders (e.g., proliferative 
disorders) associated with aberrant or unwanted TLCC-4 or TLCC-5 activity. In 
conjunction with such treatment, pharmacogenomics (i.e., the study of the relationship 
between an individual's genotype and that individual's response to a foreign compound 
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or drug) may be considered. Differences in metabolism of therapeutics can lead to 
severe toxicity or therapeutic failure by altering the relation between dose and blood 
concentration of the pharmacologically active drug. Thus, a physician or clinician may 
consider applying knowledge obtained in relevant pharmacogenomics studies in 
5 determining whether to administer a TLCC-4 or TLCC-5 molecule or TLCC-4 or 
TLCC-5 modulator as well as tailoring the dosage and/or therapeutic regimen of 
treatment with a TLCC-4 or TLCC-5 molecule or TLCC-4 or TLCC-5 modulator. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected 

10 persons. See, for example, Eichelbaum, M. et ah (1996) Clin. Exp. Pharmacol. Physiol. 
23(10-1 1): 983-985 and Linder, M.W. et ah (1997) Clin. Cheim 43(2):254-266. In 
general, two types of pharmacogenetic conditions can be differentiated. Genetic 
conditions transmitted as a single factor altering the way drugs act on the body (altered 
drug action) or genetic conditions transmitted as single factors altering the way the body 

15 acts on drugs (altered drug metabolism). These pharmacogenetic conditions can occur 
either as rare genetic defects or as naturally-occurring polymorphisms. For example, 
glucose-6-phosphate dehydrogenase deficiency (G6PD) is a common inherited 
enzymopathy in which the main clinical complication is haemolysis after ingestion of 
oxidant drugs (anti-malarials, sulfonamides, analgesics, nitrofurans) and consumption of 

20 fava beans. 

One pharmacogenomics approach to identifying genes that predict drug 
response, known as "a genome-wide association 1 ', relies primarily on a high-resolution 
map of the human genome consisting of already known gene-related markers (e.g., a 
"bi-allelic" gene marker map which consists of 60,000-100,000 polymorphic or variable 

25 sites on the human genome, each of which has two variants.) Such a high-resolution 
genetic map can be compared to a map of the genome of each of a statistically 
significant number of patients taking part in a Phase II/III drug trial to identify markers 
associated with a particular observed drug response or side effect. Alternatively, such a 
high resolution map can be generated from a combination of some ten-million known 

30 single nucleotide polymorphisms (SNPs) in the human genome. As used herein, a 
"SNP" is a common alteration that occurs in a single nucleotide base in a stretch of 
DNA. For example, a SNP may occur once per every 1000 bases of DNA. A SNP may 
be involved in a disease process, however, the vast majority may not be disease- 
associated. Given a genetic map based on the occurrence of such SNPs, individuals can 

35 be grouped into genetic categories depending on a particular pattern of SNPs in their 
individual genome. In such a manner, treatment regimens can be tailored to groups of 
genetically similar individuals, taking into account traits that may be common among 
such genetically similar individuals. 
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Alternatively, a method termed the "candidate gene approach", can be utilized to 
identify genes that predict drug response. According to this method, if a gene that 
encodes a drugs target is known {e.g., a TLCC-4 or TLCC-5 polypeptide of the present 
invention), all common variants of that gene can be fairly easily identified in the 
5 population and it can be determined if having one version of the gene versus another is 
associated with a particular drug response. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 
major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 

10 2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug effects or show exaggerated 
drug response and serious toxicity after taking the standard and safe dose of a drug. 
These polymorphisms are expressed in two phenotypes in the population, the extensive 
metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among 

15 different populations. For example, the gene coding for CYP2D6 is highly polymorphic 
and several mutations have been identified in PM, which all lead to the absence of 
functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C 19 quite frequently 
experience exaggerated drug response and side effects when they receive standard 
doses. If a metabolite is the active therapeutic moiety, PM show no therapeutic 

20 response, as demonstrated for the analgesic effect of codeine mediated by its CYP2D6- 
formed metabolite morphine. The other extreme are the so called ultra-rapid 
metabolizers who do not respond to standard doses. Recently, the molecular basis of 
ultra-rapid metabolism has been identified to be due to C YP2D6 gene amplification. 

Alternatively, a method termed the "gene expression profiling", can be utilized to 

25 identify genes that predict drug response. For example, the gene expression of an 

animal dosed with a drug (e.g., a TLCC-4 or TLCC-5 molecule or TLCC-4 or TLCC-5 
modulator of the present invention) can give an indication whether gene pathways 
related to toxicity have been turned on. 

Information generated from more than one of the above pharmacogenomics 

30 approaches can be used to determine appropriate dosage and treatment regimens for 
prophylactic or therapeutic treatment an individual. This knowledge, when applied to 
dosing or drug selection, can avoid adverse reactions or therapeutic failure and thus 
enhance therapeutic or prophylactic efficiency when treating a subject with a TLCC-4 or 
TLCC-5 molecule or TLCC-4 or TLCC-5 modulator, such as a modulator identified by 

35 one of the exemplary screening assays described herein. 
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4. Use of TLCC-4 and TLCC-5 Molecules as Surrogate Markers 

The TLCC-4 or TLCC-5 molecules of the invention are also useful as markers of 
disorders or disease states, as markers for precursors of disease states, as markers for 
predisposition of disease states, as markers of drug activity, or as markers of the 
5 pharmaco genomic profile of a subject. Using the methods described herein, the 

presence, absence and/or quantity of the TLCC-4 or TLCC-5 molecules of the invention 
may be detected, and may be correlated with one or more biological states in vivo. For 
example, the TLCC-4 or TLCC-5 molecules of the invention may serve as surrogate 
markers for one or more disorders or disease states or for conditions leading up to 

10 disease states. As used herein, a "surrogate marker" is an objective biochemical marker 
which correlates with the absence or presence of a disease or disorder, or with the 
progression of a disease or disorder (e.g. , with the presence or absence of a tumor). The 
presence or quantity of such markers is independent of the disease. Therefore, these 
markers may serve to indicate whether a particular course of treatment is effective in 

1 5 lessening a disease state or disorder. Surrogate markers are of particular use when the 
presence or extent of a disease state or disorder is difficult to assess through standard 
methodologies (e.g., early stage tumors), or when an assessment of disease progression 
is desired before a potentially dangerous clinical endpoint is reached (e.g., an assessment 
of cardiovascular disease may be made using cholesterol levels as a surrogate marker, 

20 and an analysis of HIV infection may be made using HIV RNA levels as a surrogate 

marker, well in advance of the undesirable clinical outcomes of myocardial infarction or 
fully-developed AIDS). Examples of the use of surrogate markers in the art include: 
Koomen et al (2000) J. Mass. Spectrom. 35: 258-264; and James (1994) AIDS 
Treatment News Archive 209. 

25 The TLCC-4 or TLCC-5 molecules of the invention are also useful as 

pharmacodynamic markers. As used herein, a "pharmacodynamic marker" is an 
objective biochemical marker which correlates specifically with drug effects. The' 
presence or quantity of a pharmacodynamic marker is not related to the disease state or 
disorder for which the drug is being administered; therefore, the presence or quantity of 

30 the marker is indicative of the presence or activity of the drug in a subject. For example, 
a pharmacodynamic marker may be indicative of the concentration of the drug in a 
biological tissue, in that the marker is either expressed or transcribed or not expressed or 
transcribed in that tissue in relationship to the level of the drug. In this fashion, the 
distribution or uptake of the drug may be monitored by the pharmacodynamic marker. 

35 Similarly, the presence or quantity of the pharmacodynamic marker may be related to 
the presence or quantity of the metabolic product of a drug, such that the presence or 
quantity of the marker is indicative of the relative breakdown rate of the drug in vivo. 
Pharmacodynamic markers are of particular use in increasing the sensitivity of detection 
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of drug effects, particularly when the drug is administered in low doses. Since even a 
small amount of a drug may be sufficient to activate multiple rounds of marker (e.g., a 
TLCC-4 or TLCC-5 marker) transcription or expression, the amplified marker may be in 
a quantity which is more readily detectable than the drug itself. Also, the marker may 
5 be more easily detected due to the nature of the marker itself; for example, using the 
methods described herein, anti-TLCC-4 or anti-TLCC-5 antibodies may be employed in 
an immune-based detection system for a TLCC-4 or TLCC-5 protein marker, or TLCC- 
4-specific or TLCC-5-specific radiolabeled probes may be used to detect a TLCC-4 or 
TLCC-5 mRNA marker. Furthermore, the use of a pharmacodynamic marker may offer 

10 mechanism-based prediction of risk due to drug treatment beyond the range of possible 
direct observations. Examples of the use of pharmacodynamic markers in the art 
include: Matsuda et al US 6,033,862; Hattis et al (1991) Env. Health Perspect. 90: 
229-238; Schentag (1999) Am. J. Health-Syst. Pharm. 56 Suppl. 3: S21-S24; and 
Nicolau (1999) Am, J. Health-Syst Pharm. 56 Suppl. 3: S16-S20. 

15 The TLCC-4 or TLCC-5 molecules of the invention are also useful as pharmacogenomic 
markers. As used herein, a "pharmacogenomic marker" is an objective biochemical 
marker which correlates with a specific clinical drug response or susceptibility in a 
subject (see, e.g., McLeod et al (1999) Eur. J. Cancer 35(12): 1650-1652). The 
presence or quantity of the pharmacogenomic marker is related to the predicted response 

20 of the subject to a specific drug or class of drugs prior to administration of the drug. By 
assessing the presence or quantity of one or more pharmacogenomic markers in a 
subject, a drug therapy which is most appropriate for the subject, or which is predicted 
to have a greater degree of success, may be selected. For example, based on the 
presence or quantity of RNA, or protein (e.g., TLCC-4 or TLCC-5 protein or RNA) for 

25 specific tumor markers in a subject, a drug or course of treatment may be selected that 
is optimized for the treatment of the specific tumor likely to be present in the subject. 
Similarly, the presence or absence of a specific sequence mutation in TLCC-4 or TLCC- 
5 DNA may correlate TLCC-4 or TLCC-5 drug response. The use of pharmacogenomic 
markers therefore permits the application of the most appropriate treatment for each 

30 subject without having to administer the therapy. 

This invention is further illustrated by the following examples which should not 
be construed as limiting. The contents of all references, patents and published patent 
applications cited throughout this application, as well as the Figures and the Sequence 
35 Listings, are incorporated herein by reference. 
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EXAMPLES 

EXAMPLE 1 : IDENTIFICATION AND CHARACTERIZATION OF HUMAN 

5 

TLCC-4 AND TLCC-5 cDNA 

In this example, the identification and characterization of the genes encoding 
human TLCC-4 (clone Fbh48000) and TLCC-5 (clone Fbh52920c) is described. 

10 Isolation of the TLCC-4 and TLCC-5 cDNA 

The invention is based, at least in part, on the discovery of human genes 
encoding novel polypeptides, referred to herein as TLCC-4 and TLCC-5. The entire 
sequence of the human clone Fbh48000 was determined and found to contain an open 
reading frame termed human "TLCC-4." The nucleotide sequence of the human TLCC- 

15 4 gene is set forth in Figure 1 and in the Sequence Listing as SEQ ID NO:l. The amino 
acid sequence of the human TLCC-4 expression product is set forth in Figure 1 and in 
the Sequence Listing as SEQ ID NO:2. The TLCC-4 polypeptide encoded by this 
nucleic acid comprises about 751 amino acids. Clone Fbh48000, comprising the coding 
region of human TLCC-4, was deposited with the American Type Culture Collection 

20 (ATCC®), 10801 University Boulevard, Manassas, VA 20 1 1 0-2209, on , and 

assigned Accession No. . 

The entire sequence of the human clone Fbh52920c was determined and found to 
contain an open reading frame termed human "TLCC-5." The nucleotide sequence of 
the human TLCC-5 gene is set forth in Figure 2 and in the Sequence Listing as SEQ ID 

25 NO:4. The amino acid sequence of the human TLCC-4 expression product is set forth 
in Figure 2 and in the Sequence Listing as SEQ ID NO:5. The TLCC-5 polypeptide 
comprises about 1013 amino acids. Clone Fbh48000, comprising the coding region of 
human TLCC-5, was deposited with the American Type Culture Collection (ATCC®), 
10801 University Boulevard, Manassas, VA 201 10-2209, on , and assigned 

30 Accession No. 



Analysis of the Human TLCC-4 and TLCC-5 Molecules 

The human TLCC-5 amino acid sequence was aligned with the amino acid 
sequence of transient receptor potential polypeptide 7 (TRP7) and melastatin from 
35 Homo sapiens using the CLUSTAL W (1.74) multiple sequence alignment program. 
The results of the alignments are set forth in Figures 10 and 11. 
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A search was performed against the HMM database in PFAM (Figure 5) 
resulting in the identification of four ankyrin repeat domains at about residues 167-202 
(score = 1.6), 214-246 (score = 30.6), 261-294 (score = 27.9), and 340-372 (score = 
18.6), and an ion transport protein domain at about residues 510-677 (score = 34.5) in 
5 the amino acid sequence of human TLCC-4 (SEQ ID NO:2). 

A search was also performed against the MEMSAT database resulting in the 
identification of six transmembrane domains in the amino acid sequence of human 
TLCC-4 (SEQ ID NO:2) at about residues 440-461, 488-508, 520-540, 547-565, 590- 
609, and 652-676 (Figure 6). 
10 A search was further performed against the HMM database resulting in the 

identification of two transient receptor domains at about residues 720-778 (score = 21.7) 
and 820-876 (score = 1.5), in the amino acid sequence of human TLCC-5 (SEQ ID 
NO:5) (Figure 8). 

A search was also performed against the MEMSAT database resulting in the 

15 identification of two transmembrane domains in the amino acid sequence of human 
TLCC-5 (SEQ ID NO:5) at about residues 786-803 and 826-848 (Figure 9). 

A search in the Prosite database further resulted in the identification of several 
protein kinase C phosphorylation sites in the amino acid sequence of human TLCC-4 
(SEQ ID NO:2) at about residues 37-39, 167-169, 290-292, 335-337, 374-376, 476-478, 

20 498-500, and 688-690; several N-glycosylation sites in the amino acid sequence of 
human TLCC-4 (SEQ ID NO:2) at about residues 452-455 and 683-686; a cAMP- and 
cGMP -dependent protein kinase phosphorylation site in the amino acid sequence of . 
human TLCC-4 (SEQ ID NO:2) at about residues 375-378; several casein kinase II 
phosphorylation sites in the amino acid sequence of human TLCC-4 (SEQ ID NO:2) at 

25 about residues 88-91, 163-166, 290-293, 305-308, 312-315, 388-391, 393-396, 397-400, 
402-405, 41 1-414, 498-501, 607-610, 624-627, and 699-702; several tyrosine kinase 
phosphorylation sites in the amino acid sequence of human TLCC-4 (SEQ ID NO:2) at 
about residues 253-260, 375-382, and 614-622; several N-myristoylation sites in the 
amino acid sequence of human TLCC-4 (SEQ ID NO:2) at about residues 238-243 and 

30 602-607; an amidation site in the amino acid sequence of human TLCC-4 (SEQ ID 
NO:2) at about residues 12-15; and a leucine zipper site in the amino acid sequence of 
human TLCC-4 (SEQ ID NO:2) at about residues 584-605. 

A search performed in the Prosite database further resulted in the identification 
of several protein kinase C phosphorylation sites in the amino acid sequence of human 

35 TLCC-5 (SEQ ID NO:5) at about residues 21-23, 28-30, 39-41, 105-107, 240-242, 305- 
307, 331-333, 338-340, 71 1-713, 802-804, 901-903, 972-974, and 1001-1003; several 
casein kinase II phosphorylation sites in the amino acid sequence of human TLCC-5 
(SEQ ID NO:5) at about residues 54-57, 143-146, 223-226, 240-243, 308-311, 360-363, 
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436-439, 487-490, 576-579, 725-728, 977-980, and 982-985; and several tyrosine kinase 
phosphorylation sites in the amino acid sequence of human TLCC-5 (SEQ ID NO:5) at 
about residues 49-55, 247-254, and 307-314. 

Further domain motifs were identified by using the amino acid sequence of 
5 TLCC-4 (SEQ ID NO:2) to search through the ProDom database. Numerous matches 
against protein domains described as "receptor vanilloid channel activated receptor- 
related receptor-like type OTRPC4", "channel vanilloid receptor activated receptor- 
related receptor-like OTRPC4 2B ion", "repeat ankyrin kinase nuclear factor channel", 
"ankyrin repeat kinase domain UNC-44 alternative glycoprotein EGF-like", "ankyrin", 

10 "channel osmotically receptor-related vanilloid cation", "receptor vanilloid channel 
activated receptor-related receptor-like calcium type", "calcium epithelial channel 
transporter homdlog CAT2", "channel protein receptor calcium transient potential 
transmembrane ion transport", and "receptor vanilloid channel activated osmotically", 
and the like were identified. 

15 Further domain motifs were identified by using the amino acid sequence of 

TLCC-5 (SEQ ID NO:5) to search through the ProDom database. Numerous matches 
against protein domains described as "channel protein calcium entry capacitative ionic 
transmembrane ion transport transient" and the like were identified. 

20 Tissue Distribution of TLCC-4 and TLCC-5 mRNA 

This example describes the tissue distribution of TLCC-4 mRNA, as determined 
by RT-PCR, and the tissue distribution of TLCC-4 and TLCC-5 mRNA as may be 
determined by Northern blot analysis. 

Various cDNA libraries were analyzed by RT-PCR using a human TLCC-4- 
25 specific probe. From this analysis it was determined that TLCC-4 mRNA was 

expressed predominantly in the hypothalamus and skin. TLCC-4 mRNA was found in 
moderate levels in adipose and teste, and in low levels in skeletal muscle and brain (see 
Figure 3). 

Northern blot hybridizations with the various RNA samples is performed under 
30 standard conditions and washed under stringent conditions, i.e., 0.2XSSC at 65°C. The 
DNA probe is radioactively labeled with 32p_dCTP (using the Prime-It kit (Stratagene, 
La Jolla, CA) according to the instructions of the supplier). Filters containing human 
tissue mRNA (MultiTissue Northern I and MultiTissue Northern II from Clontech, Palo 
Alto, CA) are probed in ExpressHyb hybridization solution (Clontech) and washed at 
35 high stringency according to manufacturer's recommendations. 

In situ hybridization experiments were performed using a human TLCC-4- 
specific probe indicating TLCC-4 expression in monkey brain (cortex, thalamus, 
caudate, and hippocampus), spinal cord, DRG and SRG neurons, and in hair follicles. In 
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situ hybridization with rat pain models indicated that TLCC-4 mRNA was down- 
regulated after chronic constriction injury, which causes persistent, spontaneous firing of 
neurons and results in pain. TLCC-4 mRNA was also down-regulated after treatment 
with clofibric acid, a selective muscle toxin which produces muscle pain and 
5 inflammation. 

EXAMPLE 2: EXPRESSION OF RECOMBINANT TLCC-4 AND TLCC-5 

POLYPEPTIDES IN BACTERIAL CELLS 

10 In this example, human TLCC-4 or human TLCC-5 is expressed as a 

recombinant glutathione-S-transferase (GST) fusion polypeptide in E. coli and the 
fusion polypeptide is isolated and characterized. Specifically, TLCC-4 or TLCC-5 is 
fused to GST and this fusion polypeptide is expressed in E. colU e.g^ strain PEB199. 
Expression of the GST-TLCC-4 or GST-TLCC-5 fusion protein in PEB199 is induced 

] 5 with IPTG. The recombinant fusion polypeptide is purified from crude bacterial lysates 
of the induced PEB199 strain by affinity chromatography on glutathione beads. Using 
polyacrylamide gel electrophoretic analysis of the polypeptide purified from the 
bacterial lysates, the molecular weight of the resultant fusion polypeptide is determined. 

20 EXAMPLE 3: EXPRESSION OF RECOMBINANT IC54420 

POLYPEPTIDE IN COS CELLS 

To express the human TLCC-4 or human TLCC-5 gene in COS cells, the 
pcDNA/Amp vector by Invitrogen Corporation (San Diego, CA) is used. This vector 

25 contains an SV40 origin of replication, an ampicillin resistance gene, an E. coli 

replication origin, a CMV promoter followed by a polylinker region, and an S V40 intron 
and polyadenylation site. A DNA fragment encoding the entire TLCC-4 or TLCC-5 
polypeptide and an HA tag (Wilson et al (1984) Cell 37:767) or a FLAG tag fused in- 
frame to its 3' end of the fragment is cloned into the polylinker region of the vector, 

30 thereby, placing the expression of the recombinant polypeptide under the control of the 
CMV promoter. 

To construct the plasmid, the human TLCC-4 or human TLCC-5 DNA sequence 
is amplified by PCR using two primers. The 5' primer contains the restriction site of 
interest followed by approximately twenty nucleotides of the TLCC-4 or TLCC-5 
35 coding sequence starting from the initiation codon; the 3 1 end sequence contains 
complementary sequences to the other restriction site of interest, a translation stop 
codon, the HA tag or FLAG tag and the last 20 nucleotides of the TLCC-4 or TLCC-5 
coding sequence. The PCR amplified fragment and the pCDNA/Amp vector are 
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digested with the appropriate restriction enzymes and the vector is dephosphorylated 
using the CIAP enzyme (New Engiand Biolabs, Beverly, MA). Preferably the two 
restriction sites chosen are different so that the TLCC-4 or TLCC-5 gene is inserted in 
the correct orientation. The ligation mixture is transformed into E. coli cells (strains 
5 HB101, DH5a 3 SURE, available from Stratagene Cloning Systems, La Jolla, CA, can be 
used), the transformed culture is plated on ampicillin media plates, and resistant colonies 
are selected. Plasmid DNA is isolated from transformants and examined by restriction 
analysis for the presence of the correct fragment. 

COS cells are subsequently transfected with the human TLCC-4- pcDNA/Amp 

10 plasmid DNA or human TLCC-5-pcDNA/Amp plasmid DNA using the calcium 
phosphate or calcium chloride co-precipitation methods, DEAE-dextran-mediated 
transfection, lipofectibn, or electroporation. Other suitable methods for transfecting host 
cells can be found in Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A 
Laboratory Manual 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor 

15 Laboratory Press, Cold Spring Harbor, NY, 1989. The expression of the IC54420 
polypeptide is detected by radiolabelling (35s-methionine or 35s- C ysteine available 
from NEN, Boston, MA, can be used) and immunoprecipitation (Harlow, E. and Lane, 
D. Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1988) using an HA specific monoclonal antibody. Briefly, the cells 

20 are labelled for 8 hours with 35 S-methionine (or 35 S-cysteine). The culture media are ■. 
then collected and the cells are lysed using detergents (RIP A buffer, 150 mM NaCl, 1% 
NP-40, 0.1% SDS, 0.5% DOC, 50 mM Tris, pH 7.5). Both the cell lysate and the 
culture media are precipitated with an HA specific monoclonal antibody. Precipitated - 
polypeptides are then analyzed by SDS-PAGE. 

25 Alternatively, DNA containing the human TLCC-4 or human TLCC-5 coding 

sequence is cloned directly into the polylinker of the pCDNA/Amp vector using the 
appropriate restriction sites. The resulting plasmid is transfected into COS cells in the 
manner described above, and the expression of the TLCC-4 or TLCC-5 polypeptide is 
detected by radiolabelling and immunoprecipitation using a TLCC-4-specific or TLCC- 

30 5-specific monoclonal antibody. 
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EXAMPLE 4: TISSUE DISTRIBUTION OF HUMAN TLCC-4 mRNA 
USING TAQMAN™ ANALYSIS 

5 This example describes the tissue distribution of human TLCC-4 mRNA in a 

variety of cells and tissues, as determined using the TaqMan™ procedure. The 
Taqman™ procedure is a quantitative, reverse transcription PCR-based approach for 
detecting mRNA. The RT-PCR reaction exploits the 5 5 nuclease activity of AmpliTaq 
Gold™ DNA Polymerase to cleave a TaqMan™ probe during PCR. Briefly, cDNA was 

10 generated from the samples of interest, e.g. , various human tissue samples, and used as 
the starting material for PCR amplification. In addition to the 5' and 3' gene-specific 
primers, a gene-specific oligonucleotide probe (complementary to the region being 
amplified) was included in the reaction (i.e., the Taqman™ probe). The TaqMan™ 
probe includes the oligonucleotide with a fluorescent reporter dye covalently linked to 

15 the 5 5 end of the probe (such as FAM (6-carboxyfluorescein), TET (6-carboxy-4,7,2\7'- 
tetrachlorofluorescein), JOE (6-carboxy-4,5-dicWoro-2,7-dimethoxyfluorescein), or 
VIC) and a quencher dye (TAMRA (6-carboxy-N,N,N\NMetramethylrhodamine) at the 
3' end of the probe. 

During the PCR reaction, cleavage of the probe separates the reporter dye and 

20 the quencher dye, resulting in increased fluorescence of the reporter. Accumulation of • 
PCR products is detected directly by monitoring the increase in fluorescence of the 
reporter dye. When the probe is intact, the proximity of the reporter dye to the quencher 
dye results in suppression of the reporter fluorescence. During PCR, if the target of 
interest is present, the probe specifically anneals between the forward and reverse primer 

25 sites. The 5 '-3' nucleolytic activity of the AmpliTaq™ Gold DNA Polymerase cleaves 
the probe between the reporter and the quencher only if the probe hybridizes to the 
target. The probe fragments are then displaced from the target, and polymerization of 
the strand continues. The 3' end of the probe is blocked to prevent extension of the 
probe during PCR. This process occurs in every cycle and does not interfere with the 

30 exponential accumulation of product. RNA was prepared using the trizol method and 
treated with DNase to remove contaminating genomic DNA. cDNA was synthesized 
using standard techniques. Mock cDNA synthesis in the absence of reverse 
transcriptase resulted in samples with no detectable PCR amplification of the control 
gene confirms efficient removal of genomic DNA contamination. 

35 As indicated in Figure 3, strong expression of TLCC-4 was detected in human 

brain (hypothalamus) and skin tissues. In addition, TLCC-4 expression was detected at 
moderate levels in adipose and testis tissues, and at low levels in the fetal heart, skeletal 
muscle, brain, and colon tissues. Pain human panel phase I and MP Phase 1 .3.3 libraries 
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were also analyzed and it was determined that TLCC-4 was expressed at high levels in 
the brain, cortex, and testis, at moderate levels in the spinal cord, dorsal root ganglion 
(DRG), and the hypothalamus, and at low levels in the skin, placenta, small intestine, 
ovary, prostate epithelial cells, liver, skin (decubitus), colon tumor cells, and breast 
5 tumor cells (see Figures 12 and 13). Monkey libraries were also analyzed indicating that 
TLCC-4 was expressed at high levels in the monkey cortex and hairy skin, and at low 
levels in the monkey spinal cord (see Figures 14 and 15). Metabolic libraries were also 
analyzed demonstrating that TLCC-4 was expressed at high levels in adipose and brain 
tissues, and at low levels in differentiated adipocytes and pre-adipocytes, as well as in 
10 the hypothalamus, colon, small intestine, skeletal muscle, and liver tissues 
(see Figure 16). 

EXAMPLE 5: REGULATION OF CALCIUM INFLUX THROUGH TLCC-4 

15 This experiment describes the regulation of calcium influx though TLCC-4 in 

911 cells as determined by Fluorometric Imaging Plate Reader experiments (FLIPR) 
(Molecular Devices Corp., Sunnyvale, CA). 

The FLIPR is a screening tool for cell-based fluorescent assays which allows the 
simultaneous stimulation and measurement of separate cell populations in a high 

20 throughput format. Therefore, using this system, it is possible to quantify transient 

signals, such as the release of intracellular calcium, from cell populations, in parallel and 
in real time. The FLIPR contains chambers in which to hold the test plate and plates 
containing antagonists or agonists to be added to the test plate. - The FLIPR utilizes an 
argon laser that provides discrete spectral lines spaced from approximately 350 to 530 

25 nm. For use with fluorescent Ca 2+ dyes, the 88-nm line of the laser is employed. The 
laser simultaneously illuminates the wells in a test plate. The image of each well in the 
plate is captured by a cooled charge coupled device (CCD) camera, which updates 
images once per second, if required, for the measurement of rapid calcium responses. 
Because both excitation and emission are read via the bottom of the plate, black-walled, 

30 transparent bottomed 96-well plates are used. Data captured by the CCD camera is 
converted to digital data and then transferred to a computer. 

Briefly, a calcium indicator (e.g., fluo-3/AM or Calcium Green-l/AM) was 
transferred to the culture medium. Because the FLIPR collects fluorescence from the 
bottom of the well, suspension cells require centrifugation to the base of the well 

35 following dye loading. Viable 911 cells were resuspended in loading medium and 
incubated for one hour. The cells were then centrifuged and resuspended with wash 
buffer. The cell suspension containing the dye was then aliquoted into each well of the 
black-walled, transparent bottomed 96-well plate and the plate was centrifuged. The 
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FLEPR assay was then carried out and the results analyzed. (If adherent cells are used, 
they may be plated at an appropriate density in the 96-well plates and cultured 
overnight. Dye may then be loaded and incubated). 

Results show a constitutive calcium influx through TLCC-4 in 91 1 cells that 
5 were incubated with NMDG/0 Ca 2+ and stimulated afterwards with 5 mM Ca 2+ . 

Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
10 described herein. Such equivalents are intended to be encompassed by the following 
claims. 



NSDOCID: <WO 0200722A2_1_> 
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What is claimed: 

1 . An isolated nucleic acid molecule selected from the group consisting of: 

5 

(a) a nucleic acid molecule comprising the nucleotide sequence set forth 
inSEQIDNO:l;and 

(b) a nucleic acid molecule comprising the nucleotide sequence set forth 
in SEQ ID NO:3; and 

10 (c) a nucleic acid molecule comprising the nucleotide sequence set forth 

in SEQ ID NO:4; and 

(d) a nucleic acid molecule comprising the nucleotide sequence set forth 
in SEQ ID NO:6. 

15 2. An isolated nucleic acid molecule selected from the group which 

encodes: 

(a) a polypeptide comprising the amino acid sequence set forth in SEQ 

ID NO:2; and 

20 (b) a polypeptide comprising the amino acid sequence set forth in SEQ 

ID NO:5. 

3. Isolated nucleic acid molecules comprising the nucleotide sequences 
contained in the plasmids deposited with ATCC® as Accession Numbers . 

25 

4. An isolated nucleic acid molecule which encodes a naturally occurring 
allelic variant of a polypeptide comprising the amino acid sequence set forth in SEQ ID 
NO:2 or 5. 

30 5. An isolated nucleic acid molecule selected from the group consisting of: 

a) a nucleic acid molecule comprising a nucleotide sequence which 
is at least 60% identical to the nucleotide sequence of SEQ ID NO: I, 3, 4, or 6, or a 
complement thereof; 

35 b) a nucleic acid molecule comprising a fragment of at least 30 

nucleotides of a nucleic acid comprising the nucleotide sequence of SEQ ID NO: 1,3,4, 
or 6, or a complement thereof; 
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c) a nucleic acid molecule which encodes a polypeptide comprising 
an amino acid sequence at least about 60% identical to the amino acid sequence of SEQ 
ID NO:2 or 5; and 

d) a nucleic acid molecule which encodes a fragment of a 

5 polypeptide comprising the amino acid sequence of SEQ ID NO:2 or 5, wherein the 
fragment comprises at least 10 contiguous amino acid residues of the amino acid 
sequence of SEQ ID NO:2 or 5. 

6. An isolated nucleic acid molecule which hybridizes to a complement of 
10 the nucleic acid molecule of any one of claims 1, 2, 3, 4, or 5 under stringent conditions. 



7. An isolated nucleic acid molecule comprising a nucleotide sequence 
which is complementary to the nucleotide sequence of the nucleic acid molecule of any 
one of claims 1, 2, 3, 4, or 5. 

8. An isolated nucleic acid molecule comprising the nucleic acid molecule 
of any one of claims 1, 2, 3, 4, or 5, and a nucleotide sequence encoding a heterologous 
polypeptide. 



20 9. A vector comprising the nucleic acid molecule of any one of claims 1, 2, 

3, 4, or 5. 

10. The vector of claim 9, which is an expression vector. 

25 1 1 . A host cell transfected with the expression vector of claim 1 0. 



12. A method of producing a polypeptide comprising culturing the host cell 
of claim 1 1 in an appropriate culture medium to, thereby, produce the polypeptide. 

30 13. An isolated polypeptide selected from the group consisting of: 

a) a fragment of a polypeptide comprising the amino acid sequence 
of SEQ ID NO:2 or 5, wherein the fragment comprises at least 10 contiguous amino 
acids of SEQ ID NO:2 or 5; 
35 b) a naturally occurring allelic variant of a polypeptide comprising 

the amino acid sequence of SEQ ID NO:2 or 5, wherein the polypeptide is encoded by a 
nucleic acid molecule which hybridizes to a complement of a nucleic acid molecule 
consisting of SEQ ID NO:l, 3, 4, or 6 under stringent conditions; 
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c) a polypeptide which is encoded by a nucleic acid molecule 
comprising a nucleotide sequence which is at least 60% identical to a nucleic acid 
comprising the nucleotide sequence of SEQ ID NO:l, 3, 4, or 6; and 

d) a polypeptide comprising an amino acid sequence which is at 
5 least 60% identical to the amino acid sequence of SEQ ID NO:2 or 5. 

14. The isolated polypeptide of claim 13 comprising the amino acid sequence 
of SEQ ID NO:2 or 5. 

10 15. The polypeptide of claim 13, further comprising heterologous amino acid 

sequences. 

16. An antibody which selectively binds to a polypeptide of claim 13. 

15 1 7. A method for detecting the presence of a polypeptide of claim 1 3 in a 

sample comprising: 

a) contacting the sample with a compound which selectively binds to 
the polypeptide; and 

b) determining whether the compound binds to the polypeptide in 
20 the sample to thereby detect the presence of a polypeptide of claim 13 in the sample. 

18. The method of claim 17, wherein the compound which binds to the 
polypeptide is an antibody. 

25 19. A kit comprising a compound which selectively binds to a polypeptide of 

claim 13 and instructions for use. 

20. A method for detecting the presence of a nucleic acid molecule of any 
one of claims 1, 2, 3, 4, or 5 in a sample comprising: 

30 a) contacting the sample with a nucleic acid probe or primer which 

selectively hybridizes to a complement of the nucleic acid molecule; and 

b) determining whether the nucleic acid probe or primer binds to the 
complement of the nucleic acid molecule in the sample to thereby detect the presence of 
the nucleic acid molecule of any one of claims 1, 2, 3, 4, or 5 in the sample. 

35 

21 . The method of claim 20, wherein the sample comprises mRNA 
molecules and is contacted with a nucleic acid probe. 
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22. A kit comprising a compound which selectively hybridizes to a 
complement of the nucleic acid molecule of any one of claims 1, 2, 3 , 4, or 5 and 
instructions for use. 

5 23. A method for identifying a compound which binds to a polypeptide of 

claim 13 comprising: 

a) contacting the polypeptide, or a cell expressing the polypeptide 
with a test compound; and 

b) determining whether the polypeptide binds to the test compound. 

10 

24. The method of claim 23, wherein the binding of the test compound to the 
polypeptide is detected by a method selected from the group consisting of: 

a) detection of binding by direct detection of test 
compound/polypeptide binding; 
15 b) detection of binding using a competition binding assay; and 

c) detection of binding using an assay for TLCC-4 or TLCC-5 

activity. 

25. A method for modulating the activity of a polypeptide of claim 13 
20 comprising contacting the polypeptide or a cell expressing the polypeptide with a 

compound which binds to the polypeptide in a sufficient concentration to modulate the 
activity of the polypeptide. 

26. A method for identifying a compound which modulates the activity of a 
25 polypeptide of claim 13 comprising: 

a) contacting a polypeptide of claim 13 with a test compound; and 

b) determining the effect of the test compound on the activity of the 
polypeptide to thereby identify a compound which modulates the activity of the 
polypeptide. 

30 

27. The method of claim 26, wherein said activity is modulation of 
nociception. 

28. A method for identifying a compound which modulates pain comprising: 
35 a) contacting the polypeptide of claim 13, or a cell expressing the 

polypeptide with a test compound; and 

b) identifying the compound as a modulator of pain by determining the 
effect of the test compound on the activity of the polypeptide. 
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29. A method for identifying a compound which modulates nociception 
comprising: 

a) contacting the polypeptide of claim 13, or a cell expressing the 
5 polypeptide with a test compound; and 

b) identifying the compound as a modulator of nociception by 
determining the effect of the test compound on the activity of the polypeptide. 

30. A method for treating a subject having a pain disorder comprising 
10 administering to the subject a TLCC-4 or TLCC-5 modulator, thereby treating said 

subject having a pain disorder. 

31. A method for treating a subject having a pain disorder comprising 
administering to the subject a TLCC-4 or TLCC-5 modulator, wherein the TLCC-4 or 

15 TLCC-5 modulator is the modulator identified by the method of claim 26, thereby 
treating said subject having a pain disorder. 

32. The method of claim 30, wherein the TLCC-4 or TLCC-5 modulator is a 
small molecule. 

20 

33. The method of claim 30, wherein said TLCC-4 or TLCC-5 modulator is 
administered in a pharmaceutically acceptable formulation. 

34. The method of claim 30, wherein said TLCC-4 or TLCC-5 modulator is 
25 administered using a gene therapy vector. 

35. The method of 30, wherein the TLCC-4 or TLCC-5 modulator is capable 
of modulating TLCC-4 or TLCC-5 polypeptide activity. 

30 
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CGACGCGTCCGATTCCAGCCATCCCTCTGCCTGCAATGAGAGCTTCCCGCCGCCTCAGCC 

ACAGTCCCACCGGGGGGCGTTGGGCCCCAGACATGCGGTGATCTCAGGGCAAGGGTTGCC 

AGGACCACeCAGAACCTCACCAGCCATGAAAGCCCACCCCAAGGAGATGGTGCCTCTCAT 

GGGCAA.GAQAGTTGCTGCCCCCAGTGGGAACCCTGCCGTCCTGCCAGAGAAGAGGCCGGC 

G.GAGATCACCCCCACAAAGAAGAGTGCACACTTCTTCCTGGAGATAGAAGGGTTTGAACC 

CAAeCGCACAGTTGCCAAGACCTCTCCTCCTGTCTTCTCCAAGCCCATGGATTCCAACAT 

GCGQQAGTGCATCTQTGGTAACTGTGATGACATGGACTCCCCCCAGTCTCCTCAAGATGA 

TGTGACAGAGACCCCATCCAATCCCAACAGCCCCAGTGGACAGCTGGCCAAGGAAGAGCA 

GAGGAqGAAAAAGAGGCGGCTGAAGAAGCGCATCTTTGCAGCCGTGTCTGAGGGCTGCGT 

GGAQGAGTTGGTAGAGTTgGTGGTGGAGCTGCAGGAGCTTTGCAGGCGGCGCCATGATGA 

GGATGTGCCTGACTTCCTCATGCACAAGCTGACGGCCTCCGACACGGGGAAGACCTGCCT 

,GATGAAGGCeTTGTTAAACATCAACCCCAACACCAAGGAGATCGTGCGGATCCTGCTTGC 

CTTTGCTGAAGAGAACGACATCCTGGGCAGGTTCATCAACGCCGAGTACACAGAGGAGGC 

CTATGAAGGGCAGAGGGCGCTGAACATCGCCATCGAGCGGCGGCAGGGGGACATCGCAGC 

CCTGCTjCATCGCCGCCGGCGCCGACGTCAACGCGCACGCCAAGGGGGCCTTCTTCAACCC 

CAAGTACCAACACGAAGGGTTCTACTTCGGTGAGACGCCCCTGGCCCTGGCAGCATGCAC 

CAACCAGCCCGAGATTGTGCAGCTGCTGATGGAGCACGAGCAGACGGACATCACCTCGCG 

GGACTCACGAGGCAACAACATCCTTCACGCCCTGGTGACCGTGGCCGAGGACTTCAAGAC 

GCAGAATGAGTTTGTGAAGGGCATGTACGACATGATCCTACTGCGGAGTGGCAACTGGGA 

GCTGGAGACCACTCGCAACAACGATGGCCTCACGCCGCTGCAGCTGGGCGGCAAGATGGG 

CAAGGCGGAGATCCTGAAGTACATCCTCAGTCGTGAGATCAAGGAGAAGCGGCTCCGGAG 

CCTGTCCAGGAAGTTCACGGACTGGGCGTACGGACCCGTGTCATCCTCCCTCTACGACCT 

CAGCAACGTGGACACCACCACGGACAACTCAGTGCTGGAAATCACTGTCTACAACACCAA 

CATCGACAACCGGCATGAGATGCTGACCCTGGAGCCGCTGCACACGCTGCTGCATATGAA 

GTGGAAGAAGTTTGCCAAGCACATGTTCTTTCTGTCCTTCTGCTT7TATTTCTTCTACAA 

CATCACCCTGACCCTCGTCTCGTACTACCGCCCCCGGGAGGAGGAGGCCATCCCGCACCC 

CTTGGCCCTGACGCACAAGATGGGGTGGCTGCAGCTCCTAGGGAGGATGTTTGTGCTCAT 

CTGGGCCATGTGCATCTCTGTGAAAGAGGGCATTGCCATCTTCCTGCTGAGACCCTCGGA 

TCTGCAGTCCATCCTCTCGGATGCCTGGTTCCACTTTGTCTTTTTTATCCAAGCTGTGCT 

TGTGATACTGTCTGTCTTCTTGTACTTGTTTGCCTACAAAGAGTACCTCGCCTGCCTCGT 

GCTGGCCATGGCCCTGGGCTGGGCGAACATGCTCTACTATACGCGGGGTTTCCAGTCCAT 

GGGCATGTACAGCGTCATGATCCAGAAGGTCATTTTGCATGATGTTCTGAAGTTCTTGTT 

TGTATATATCGTGTTTTTGCTTGGATTTGGAGTAGCCTTGGCCTCGCTGATCGAGAAGTG 

TCCCAAAGACAACAAGGACTGCAGCTCCTACGGCAGCTTCAGTGACGCAGTGCTGGAACT 

CTTCAAGCTCACCATAGGCCTGGGTGATCTGAACATCCAGCAGAACTCCAAGTATCCCAT 

TCTCTTTCTGTTCCTGCTCATCACCTATGTCATCCTCACCTTTGTTCTCCTCCTCAACAT 

GCTCATTGCTCTGATGGGCGAGACTGTGGAGAACGTCTCCAAGGAGAGCGAACGCATCTG 

GCGCCTGCAGAGAGCCAGGACCATCTTGGAGTTTGAGAAAATGTTACCAGAATGGCTGAG 

GAGCAGATTCCGGATGGGAGAGCTGTGCAAAGTGGCCGAGGATGATTTCCGACTGTGTTT 

GCGGATCAATGAGGTGAAGTGGACTGAATGGAAGACGCACGTCTCCTTCCTTAACGAAGA 

CCCGGGGCCTGTAAGACGAACAGATTTCAACAAAATCCAAGATTCTTCCAGGAACAACAG 

CAAAACCACTCTCAATGCATTTGAAGAAGTCGAGGAATTCCCGGAAACCTCGGTGTAGAA 

GCGGAACCCAGAGCTGGTGTGCGCGTGCGCTGTCTGGCGCTGCAGGCGGAGTCACCGACT 

CTGTGCAGAGAGGCTTTGAGGGATGATGGAGTCCGGCTCTGCTGGCCTAGAAGCAGAGTG 

CACCCTCGTGCTCAGTGCTCAGTGGGTGTCTGAACTGAGGGGCAGTTGTCAATTTGTCTG 

AGTGGGAAACATCCTGGATTTTGTTACTTGGCAAACAGCTGGTGTAAACCTACAGCCAGC 

AGCAGTCTGGAGCCTGGGAGCCTCCTGAAGTCCCGGGTPAAGCCTCTGGTTTTACCAATT 

GCAGGTCGGCJTGGCTGGGAGAGATGGATGGCGGGAAAGGGGtAGCAGTCTTGAGGAGCA 

GGGAGAGGAGTCTTTCCTCCTGCCAGCTTCCCCCGTCAGCCCCAACCCCAGCCCACACAT 

TGTACCATCTCTTCTGCTGTGACTGGGTTGCCTGAATTTGTGGGAGACCCGTGATCCCAT 

CCCAGAGTGTGCGGGGGACGGAGGTAAGCTGGATATCCTGGGGGAGGAGGGGAATGCGCT 

CTGGAAACACCCTTCCGGAACCCTTCGGGGAAAAGGAGACCATCCTTGGAGTGAACGTCC 

CCTGACACCCCAAGGTTCAAACTGTCTCAAGCTGAGAGATGTTTTTAGTAGCATAATTAA 

CAC^GGGTTTTAACTTGCAATACGGAAAAGACATTTCAGTTGAGAATGAAAATTACTACA 

ATGAAGTTTGTGATTTTAAAAGTGGAGACAGACTGGGGGCTTTGGGGCTGGATGTAAGTA 
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TTATATATTTGGCCTCAGGGTGCCCAGAGCAAGACAAAAAGCTTTTCTTCACACACACAA 

AAGTCTGCATGAGACACTCCGGGCAAGTCCTGCTGGGCCGCCGCGATCTGGGTGAAAGGT 

CCTGGCTCTTTTCCTCGTCCTGACCTCACAGTAGCGCATGCCTGTGTGCTGGGATCGTGG 

CTTTCGCTGAAGCAGAAATAGCAGCTGCTCGATCGATATCATCTTGGAACTCAGCAGTTA 

GTCCKIATACCTCAGTACGTCTCAGTGGGGGAATTTAACAAAATGCCTCAACTGCTTTGGT 

ACGAAGTAI I 1 1 I nTTI A ATTTTA ACTGTG A ATTTTG A AGCTG A AGGGG A AGCTTGTG A 

GAGAAAAGCATTTGCCAAGACTTTGAGCTTA 11111 AGGTCCTCGTCCTCTGATGTTCTC 

TTTCTGAAATGACACGGAGTCAGTCTCGGGGGCAGAGGTGAAGTGGAGACGGAAGGATTT 

TCCAGGTGACTGGGGCCGAAACCACCAGAAAATCCACTCTGCCGCCGTTATCTGGTGAAA 

GGATTCATGTAAAAATGTTCGAGGTGGAATTATAAAAATAGTAACCATAAATGTTAATCT 

TAAATGGCAGAAATAGAAATTTGGCCTTCAGATAACATGGCGATAGATAAGTTCATCTGG 

CTTGAGGCAAACTGAAGAGTCGGGGCCTAGCATTGCACTCTGGGCCAGTTTCTCTGCCCT 

GGGCCACTCTGTGTGCCAGACTAGCTGGACAGATAGAGACTTTGTGCCCCTGATGGGGCC 

GATTGGGGAGAGGTGGGCTGGGGTGTGCAGGCTTCACAATCCACAGCAGCCCCTGCCCTC 

CCAGCTGACCCAGGGAGTAATCGCGTGCTCTAAGCCACAGTGGTCGGGGCTGGGCATGGG 

CCTCTGGAGAAGAGAAGATTTGAGGAGAACTGTCCTAGAGGCAGGAGGAGCAGATGTGTT 

TCAGAATGGGCAGAATTAGGAAATTGAGAAAGATTTTGGCTCAACAGAATCCAGCAACTG 

CTCCAGATGTTGGAGATGTTTAAGCAGAAGCTGGTTGAGCACTTAATGAGGAATGTTGTT 

GAAAATGGTCATTGGAAGAAGTTTAAGGTCCCTTTTAGCCTGGAGATTGTACAAATCAGC 

ATTCCACATCTGGAGTTAGCTACCCGCATTAAGCCTGAACAGACATCTTGGTCTGAAAGG 

AAGTGGTTTGGATTCATGATGCCAAGCTCCACACTATGGAGCTGGGAATTCCAGAATTGC 

TTTGACTCAGATATTAATGGAGAAAG 



MKAHPKBMVPLMGKRVAAPSGNPAVLPEKRPAE ITPTKKSAHFFLE I EGFE PNPTVAKTS 
PPVFSKPMDSNI RQC ISGNCDDMDS PQS PQDDVTETPSNPNS PS AQLAKEEQRRKKRRLK 
KRIFAAVSEGCVEELVELLVELQELCRRRHDEDVPDFLMHKLT^ 

PNTKE I VR I LLAFAEENDI LGRF INAE YTEE AYEGQTA1.N I AI ERRQGDI AALLI AAG AD 
VNAHAKGAFFNPKYQHEGFYFGETPLALAACTNQPEIVQLLMEHEOTDITSRDSRGNNIL 
HALVTVAEDFKTQNDFVKRMYDMI LLRSGNWELETTRNNDGLTPLQLAAKMGKAE I LKY I 
LS RE I KEKRLRS LS RKFTDWAYGP VS S S L YDLTNVDTTTDNS VLE I TVYNTN I DNRH EML 
TLEPLHTLLHMKWKKFAKHMFFLSFCFYFFYNITLTLVSYYRPREEEAIPHPLiALTHKMG 
WLQLLGRMFVLI WAMCI SVKEG I AI FLLRPSDLQS I LSDAWFHFVFFIQAVLVT LS VFLY 
LFAYKEYIACLVIAMALGWAmLYYTRGFQSMGMYSVMIQK^ 

FGVALASLIEKCPKDNKDCSSYGSFSDAVLELFKLTIGLGDLNIQQNSKYPILFLFLLIT 
YVILTFVLLLNMLIAIJ^GETVENVSKESERIVmi^RARTILEFEKMLPEWLRSRFRMGEL 
CKVAEDDFRLCLR I NEVKWTEX 
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Input file Fbh52920c-seq; Output File Fbh52920c . tra 
Sequence length 3042 

M V G G C R W T E D V E P A E V K E K M 
ATG GTT GGA GGA TGC AGG TGG ACA GAA GAC GTG GAG CCT GCA GAA GTA AAG GAA AAG ATG 

S FRAARL SMRHRRNDTI.DST 
TCC TTT CGG GCA GCC AGG CTC AGC ATG AGG AAC AGA AGG AAT GAC ACT CTG GAC AGC ACC 

R TLYS SASRSTDLSYSESDL 
CGG ACC CTG TAC TCC AGC GCG TCT CGG AGC ACA GAC TTG TCT TAC AGT GAA AGC GAC TTG 



QINQSEKWNYKKHTKEFPTD 
CAG ATC AAC CAA AGT GAG AAA TGG AAC TAC AAG AAA CAC ACC AAG GAA TTT CCT ACC GAC 

-A FGDI Q FETLGKK G KY I RLS 
GCC TTT GGG GAT ATT CAG TTT GAG ACA CTG GGG AAG AAA GGG AAG TAT ATA CGT CTG TCC 

CDTDAEILYELLTQHWHLKT 
TGC GAC ACG GAC GCG GAA ATC CTT TAC GAG CTG CTG ACC CAG CAC TGG CAC CTG AAA ACA 

P R M 



RKIFSRLIYIAQSKGAWILT 
CGC AAG ATC TTC AGC CGG CTC ATC TAC ATC GCG CAG TCC AAA GGT GCT TGG ATT CTC ACG 

G GTHYG LMKYLG EVV RDNTI 
GGA GGC ACC CAT TAT GGC CTG ATG AAG TAC CTC GGG GAG GTG GTG AGA GAT AAC ACC ATC 

SRSSEENIVAIGIAAWGMVS 
AGC AGG AGT TCA GAG GAG AAT ATT GTG GCC ATT GGC ATA GCA GCT TGG GGC ATG GTC TCC 

NRDTI* I RNCDAEG Y F LAQYE* 
AAC CGG GAC ACC CTC ATC AGG AAT TGC GAT GCT GAG GGC TAT TTT TTA GCC CAG TAC CTT 

MDDFTRDPLYILDNNHTHLL 
ATG GAT GAC TTC ACA AGA GAT CCA CTG TAT ATC CTG GAC AAC AAC CAC ACA CAT TTG CTG 

LVDNGCHGHPTVEAKLRNQL 
CTC GTG GAC AAT GGC TCT CAT GGA CAT CCC ACT GTC GAA GCA AAG CTC CGG AAT CAG CTA 

EKYISERTIQDSNYGGKIPI 
GAG AAG TAT ATC TCT GAG CGC ACT ATT CAA GAT TCC AAC TAT GGT GGC AAG ATC CCC ATT 

VCFAQGGGKETLKAINTS IK 
GTC TGT TTT GCC CAA GGA GGT GGA AAA GAG ACT TTG AAA GCC ATC AAT ACC TCC ATC AAA 

NKIPCVVV E GSGQIADVIAS 
AAT AAA ATT CCT TGT GTG GTG GTG GAA GGC TCG GGC CAG ATC GCT GAT GTG ATC GCT AGC 

L V E V E D A L T S S A V K E K U V R F 
CTG GTG GAG GTG GAG GAT GCC CTG ACA TCT TCT GCC GTC AAG GAG AAG CTG GTG CGC TTT 

L P R T V S R L P E E E T E S W I K W L, 
TTA CCC CGC ACG GTG TCC CGG CTG CCT GAG GAG GAG ACT GAG AGT TGG ATC AAA TGG CTC 

K E I L E'C S H If L T V I K M E E A G D. 
AAA GAA ATT CTC GAA TGT TCT CAC CTA TTA ACA GTT ATT AAA ATG GAA GAA GCT GGG GAT 



20 
60 

40 
120 

60 
180 



VNFIQAN FKKRECVFFTKDS -80 

GTG AAT TTT ATT CAA GCA AAT TTT AAG AAA CGA GAA TGT GTC TTC TTT ACC AAA GAT TCC 240 

K A T E N V C K C G Y A Q S Q H M E G T iOO 

AAG GCC ACG GAG AAT GTG TGC AAG TGT GGC TAT GCC CAG AGC CAG' CAC ATG GAA GGC ACC 300 
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360 
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480 
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PNLVISVTGGAKN.FALKP 
CCC AAC CTG GTC ATT TCT GTG ACC GGG GGC GCC AAG AAC TTC GCC CTG AAG CCG CGC ATG 540 
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E.I v SNA I. S Y A 
GAA ATT GTG AGC AAT GCC ATC TCC TAC GCT 

D. K D _N W N G Q L K 
GAC AAG GAT AAC TGG AAT GGG*CAG CTG AAG 

ANDEIFTN .DR 
GCC AAT GAT GAG ATT TTC ACC AAT GAC CGC 

D T I I Q V T W .L E 
GAC" ACA ATA ATC CAG GTC ACA TGG CTG GAA 

V T D G . K . A . ' S S . . H M 
GTG ACT GAC GGC AAA"' GCC TCT TCT" CAT ATG 

EVMFTAL IKD 
GAA GTC ATG TTT ACG GCT CTC ATA AAG GAC 

NGLNL.RKFLT 
AAT GGC TTG AAC CTA CGG AAG TTT CTC ACC 

HFSTLVYRNL. 
CAC TTC AGC ACG CTT GTG TAC CGG AAT CTG 

LLTFVWKLVA 
CTC CTC ACG TTT GTC TGG AAA CTG GTT GCG 

RNGRDEMD IE 
AGA AAT GGC CGG GAC GAG ATG GAC ATA GAA 

PLQAL F IWAI 
CCC CTG CAA GCT CTC TTC ATC TGG GCC ATT 

IWEQTRGCTL 
ATT TGG GAG CAG ACC AGG GGC TGC ACT CTG 

T L A K V K N D 'l N 
ACT CTG GCC AAA GTG AAG AAC GAC ATC AAT 

EYLTRAVGES 
GAG TAC CTG ACC CGG GCT GTT GGT GAG TCC 

L P C G T D I A S G 
CTG CCA TGT GGC ACA GAC ATT GCC AGC GGC 

TECYSSDEDL 
ACT GAG TGT TAC AGC AGC GAT GAA GAC TTG 

AWGGSNC LEL 
GCT TGG GGT GGA AGC AAC TGT CTG GAG CTG 

AQPGVQNFLS 
GCC CAG CCT GGG GTC CAG AAT TTT CTT TCT 

TKNWKIILCL, 
* ACC AAG AAC TGG AAG ATT ATC CTG TGT CTG 

V S F R .K .K P .V D K 
GTA TCA TTT AGG AAG AAA CCT GTC GAC AAG 

FFTSPFVVFS 
TTC TTC ACC TCC CCC TTC GTG GTC TTC TCC 

LLFAYVLLMD 



LYKAFSTSEQ 440 

CTA TAC AAA GCC TTC AGC ACC AGT GAG CAA 1320 

LLLEWNQLDL 460 

CTT CTG CTG GAG TGG AAC CAG CTG GAC TTA 1380 

RWEKSKPRLR 480 

CGA TGG GAG AAG AGC AAA CCG AGG CTC AGA 1440 

NGR I . K V E S K D 500 

AAT GGT AGA ATC AAG GTT GAG AGC AAA GAT 15.00 

LVVCKSADLQ 520 

CTG GTG GTT CTC AAG TCT GCT GAC CTT CAA 1560 

RPKFVRLFLE 540 

AGA CCC AAG TTT GTC CGC CTC TTT CTG GAG 1620 

HDVLTELFSN 560 

CAT GAT GTC CTC ACT GAA CTC TTC TCC AAC 1680 

QIAKNSYNDA 580 

CAG ATC GCC AAG AAT TCC TAT AAT GAT GCC 1740 

NFRRGFRKED 600 

AAC TTC CGA AGA GGC TTC CGG AAG GAA GAC 1800 

LHDVS PITRH 620 

CTC CAC GAC GTG TCT CCT ATT ACT CGG CAC 1860 

LQNKKELSKV 640 

CTT CAG AAT AAG AAG GAA CTC TCC AAA GTC 1920 

AALGASKLLK 660 

GCA GCC CTG GGA GCC AGC AAG CTT CTG AAG 1980 

AAGESEELAM 680 

GCT GCT GGG GAG TCC GAG GAG CTG GCT AAT 2040 

TVWNAVVGAD 700 

ACA GTG TGG AAT GCT GTG GTG GGC GCG GAT 2100 

THRPDGGELF 720 

ACT CAT AGA CCA GAT GGT GGA GAG CTG TTC 2160 

AEQLLVYSCE 740 

GCA GAA CAG CTG CTG GTC TAT TCC TGT GAA 2220 

AVEATDQHFI 760 

GCG GTG GAG GCC ACA GAC CAG CAT TTC ATC 22 80 

KQWYGEISRD 780 

AAG CAA TGG TAT GGA GAG ATT TCC CGA GAC 2340 

F • I I PL.VGCGF 800 

TTT ATT ATA CCC TTG GTG GGC. TGT GGC TTT . 2400 : 

H K K L. UWYYVA 820 
CAC AAG AAG CTG CTT TGG TAC TAT GTG GCG 2460 

WNVVFY I A F Xj 840 
TGG AAT GTG GTC TTC TAC ATC GCC TTC CTC 2520 

FHSVPHPPEL 860 
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CTG CTG TTT GCC TAC GTG CTG CTC ATG GAT TTC 

V L Y S L V F V L F C 
GTC CTG TAC TCG CTG GTC TTT GTC CTC TTC TGT 

A P S A G P A'K P T P 
GCT CCC AGT GCG GGG CCC GCC AAG CCC ACG CCC 

STRSPGS 'RSRH 
TCC ACA CGC AGC CCC GGT TCC CGC TCA CGC CAC 

G A S S G L G Q P R K 
GGT GCC AGC TCT GGC CTT GGC CAG CCC AGA AAG 

K G S S S A " A K V G A 
AAG" GGC TCC TCA AGT GCC GCC AAA GTG GGA GCC 

EGCEDCQ HAVT 
GAG GGC TGT GAG GAC TGC CAG CAC GCT GTC ACC 

QTDEDLFPYGA 
CAA ACA GAC GAA GAT CTC TTC CCC TAT GGA GCA 

SFRGEEMSIGK 
AGC TTT CGA GGA GAG GAG ATG AGC ATC GGC AAG 



CAT TCG GTG CCA CAC CCC CCC GAG CTG 2580 

DEVRQGRPA 880 

GAT GAA GTG AGA CAG GGC CGG CCG GCT 2640 

TRNSIWPAS 900 

ACC CGG AAC TCC ATC TGG CCC GCA AGC 2700 

SFHTSLQAE 920 

TCC TTC CAC ACT TCC CTG CAA GCT GAG 2760 

G L PQ'CSGG L 940 

GGG CTC CCA CAG TGC AGC GGT GGG CTG 2820 

QAEEVPRAS 960 

CAG GCA GAG GAG GTG CCG AGA GCA AGC 2880 

SQKRKTAMD 980 

TCT CAG AAG CGT AAG ACA GCA ATG GAC 2940 

FYQF L M I SR... 1000 

TTC TAC CAG TTC CTG ATG ATT TCC AGG 3000 

Q H * -1014 

CAG CAC TAA 3042 
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F'fflHkifa Mr ^ Ma f.*l>-tc -Tyrejn.tr on? 




I ' ' ' I i ' I • I • I ■ ' ' i • I ' . I : ■ • . ■ i • • ' ■ , ■ i ■ i ' . i i « j . , . . . , . , . , . . . i . , . , . . 

I 41 81 121 lf,l ?»! ?.:\ .*U»I Ifil <1«1 441 .181 5?1 5R1 fi'H nil R8I 7?1 



>48000FL 

MKi^PKEKVPLMGKRVAAPSGNPAVIjPEKRPAEITPTKKSAHFFLEIEGFEPNPTVAKTS 
PPVFSKPMDSNIRQCISGNCDDMDSPQSPQDDVTETPSNPNSPSAQLAKEEQRRKKRRLK 
KRIFAAVSEGCVEEIiVELLVELQELCRRRHDEDVPDFI^K^ 

PNTKEIVRILLAFAEENDILGRFINAEYTEEAYEGQTALNIAIE.RRQGDIAALLIAAGAD 
VNAHAKGAFFNPKTQHEGFYFGETPLALAACTNQPEIVQLLMEHEQTDITSRDSRGNNIL 
HALVTVAED FKTQND FVKRMYDM I LLRSGNWELETTRNNDGLTPLQIxAAKMGKAE I LKYI 
LSREIKEKRLRSLSRKFTDWAYGPVSSSLYDLTNVDTTTDNSVLEITVYNTNIDNRHEML 
TliEPLHTLLHMKWKKFAKHMFFLSFCFYFFYNITLTLVSYYRPREEEAIPHPLAIjTHKMG 
WLQLLGRMFVLIWAMCISVKEGIAIFLLRPSDLQSILSDAWFHFVFFIQAVLVILSVFLY 
LFAYKEYLACLVLAMALGWANMLYYTRGFQSMGM^ 

FGVALASLIEKCPKDNKDCSSYGSFSDAVLELFKLTIGLGDLNIQQNSKYPILFLFLLIT 

WILTFVLLLlSnVILIAI^^ 

CKVAEDDFRLCLRINEVKWTEX 
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Protein Family / Domain Matches, HMMer version 2 



Searching for complete domains in PFAM 

hmmpfam - search a single seq against HMM database 

HMMER 2.1.1 (Dec 1998) 

Copyright (C) 1992-1998 Washington University School of Medicine 
HMMER is freely distributed under the GNU General Public License (GPL) 



HMM .file: 
Sequence file: 



/prod/ddm/seqanal/PFAM/pfam6 .2/Pfam 

/ prod/ ddm/ wspace/ orf anal/oa-script . 28060 . seq 



Query : 



48000FL 



Scores for sequence family classification (score includes all domains) 



Model 
value 



Description 



N 



ank 
19 4 
ion_trans 
06 1 

Peptidase_M2 9 
4 1 

Na_Galacto_symp Sodium :galactoside symporter family 
8.4 l 



Ank repeat 

Ion transport protein 
Thermophilic metalloprotease (M29) 



Score 

79.0 
34.5 
0.9 
-153 .9 



le- 

2 .5e- 



Parsed for domains : 



Model 


Domain 


seq-f 


seq-t 


hmm-f 


hmm-t 




score 


E 


-value 


ank 


1/4 


167 


202 . 


1 


33 


[] 


1.6 


1 


.4e+02 


ank 


2/4 


214 


246 . 


1 


33 


□ 


30.6 


3 


. 5e-05 


ank 


3/4 


261 


294 . 


1 


33 


[3 


27 . 9 


0 


. 00024 


Pep t i da s e_M2 9 


1/1 


325 


331 . 


402 


408 




0. 9 




4 


ank 


4/4 


340 


372 . 


1 


33 


[] 


18 . 6 




0.15 


ion_ trans 


1/1 


510 


677 . 


1 


223 


[J 


34. 5 


2 


. 5e-06 


Na_Galacto_jsymp 


1/1 


435 


699 . 


1 


285 


[] 


-153 . 9 




8 .4 



Alignments of top-scoring domains: 

ank: domain 1 of 4 , from 167 to 202: score 1.6, E = 1.4e+02 

*->dGrTPLHlAarnG. . . hlewklLLeaGAdvnardk<- 
G T+L+ A n +++ e+v++LL + + + 

48000FL 167 TGKTCLMKALLNInpnTKEIVRILLAFAEENDILGR 



202 



ank: domain 2 of 4 , from 214 to 246: score 30.6, E = 3.5e-05 

* - >dGrTPLHlAarnGhlewklLLeaGAdvnardk< - * 

+G+T+L +A+++ + + + + 1L+ aGAdvna k 
48000FL 214 EGQT ALNI A I ERRQGD I AALL I AAG AD VN AHAK 246 

ank: domain 3 of 4 , from 261 to 294: score 27.9, E = 0.00024 

* - >dGrTPLHlAarnGhlewklLLe . aGAdvnardk< - * 

G+TPL lAa+ +++e+v+lL+e+ * d+ rd 
4.8000FL 261 FGETPLALAACTNQPEIVQLLMEhEQTDITSRDS 294 
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Peptidase_M2 9 : 



48000FL 



domain 1 of 1, from 325 to 331: 
*->ifRrGnW<-* 
++R+GnW 
32 5 LLRSGNW 3 31 



score 0.9, E 



ank: domain 4 of 4 , from 340 to 372: score 18.6, E = 0.15 

* - >dGrTPLHlAarnGhlewklLLeaGAdvnardk< - * 
dG TPL lAa++G e++k++L++ 
48000FL 34 0 DGLTPLQLAAKMGKAEILKYILSREIKEKRLRS 3 72 

ion_trans: domain l of 1, . from 510 to 677: score 34.5, E = 2.5e-06 

*->ilf ildllfvllflleivlkf iayglkstsniaakylksifnildll 
.+ + +l++++f ++++++ + +1 +1+++ + +1 

48000FL 510 PSDhQS IliSDAWFHFVFFIQAVLVIL SVFLYLFAY- KEYL 

548 



ailplllllvlf lsgteqvakkrlrerfslelsqwyyrilrf IrlLrllR 
a+1+1 ++1 +++ 
549 ACLVL- -AMALGWAN 



48000FL 



561 



48000FL 



600 



lLrllrllrrletlfefelgtlaWslqslg.ralksilrfllllllllig 
+1 + + r + + + +g + + ++++++1 ++l + f i ++ +-f+ + l+g 
562 MLYYTRGFQS MGMY SVMIQkVILHDVLKFLFVYIVFLLG 



fsvigyllfkgyedlse.nevdgnsefssyfdafyflfvtlttvGfGdlv 
f+v+++ 1 + ++ + ++ + s++ s++da + If + ++G ++ 

48000FL 601 FGVALAS LIE KCpKDNKDCSSYGSFSDAVLELFKLTIGLGDLNIQ 

645 

pv. . . . wlgiif fvlf f iivgllllnlliavi<-* 
++++ ++1 +++++++i+ +llln+lia++ 
48000FL 646 QNskypILFLFLLITYVILTFVLLLNMLIALM 677 

Na_Galacto_symp: domain 1 of 1, from 435 to 699: score -153.9, E = 8.4 

*->qlGyf f falVLslagwllwiCf fgt . . . kEvySssdtrengqktts 
++ ++f 1 ++ ++ +++ + +++E + + + + t 

.4 8 0 0 0 FL 4 3 5 KFAKHMFFLS FCFYFFYNITLTLVS Yy rpREEE AI PH PLALTH 

477 

HqslkllakNdQLliLclaalfyllainilggaqlYYvtYvLGdpelFs 
++ 1+11++ . L + + ++ai+ 1 +1 .+ +L d+ 
4 78 KMGWLQLLGRMFVLIWAMCISVKEGIAIFLLRPSDL QS I LSDAWFHF 



48000FL 
524 
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ylllynilvglig.slLf . . PrLvkrf gkktvFagcivlm . . .vlgslli 
++ + + lv+1 + Lf + ++++ + ++++ 1+ +++ s ++ 

48000FL 525 VFFIQAVljVILSVf LYLFayKEYLACLV^ 

Ffvagsslalilvlif la. . . gilqqlvtllvWvlQVIMvsDtVDYGEwk 
++v+ + ++1 vl fl+ +1++++ +1 ++ e + 

48000FL 575 YSVMIQKVILHDVLKFLFvyiVFLLGFGVALASLI EKC 

612 

tGvRlEGlvySvflfvlKlGlAlsGalvGwiLgyiGY.vanasqststal 
+ ++ y + +s a+ ++ +iG ++ n+ q + + 

48000FL 613 PKDNKDCSSY GSFSDAVLELFKLTIGLgDLNIQQNSKYPI 

6 52 



gQlvfilalFalPpallllaafimlrfYkLtekklaeIveeLek.Wr.tr 
f ++1++++ ++++1++ ++ l +++++ ++ e e + Wr +r 

48 000FL 6 53 L FLFLLITYVILTFVLLL NMLIALMGETVENVSKESERiWRlQR 

696 



krk<-* 
+r 

48000FL 697 ART 699 
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Transmembrane Segments Predicted by MEMSAT 
Start 

End 

Orient 

Score 

440 

461 

ins-->out 

4 . 0 

488 

508 - 

out-->ins 

1.7 

520 

S40 . 

ins- - >out 

4.3 

547 

565 

out- -> ins 

3.7 

590 

609 

ins- - >out 

5.0 

652 

676 

out- - >ins 

6.9 



>48000FL 

MKAHPKEMVPLMGKRVAAPSGNPAVLPEKRP AE ITPTKKS AHFFLE I EGFEPNPTVAKTS 

PPVFSKPMDSNI RQC I SGNCDDMDS PQSPQDDVTETPSNPNS PS AQLAKEEQRRKKRRLK 

KRIFAAVSEGCVEELVEIiLVELQELCRRRHDEDVPDFLMHKLTASDTGKTCLMK^ 

PNTKEIVRILLAFAEENDILGRFINAEYTEEAYEGQTALNIAIERRQGDIAALLIAAGAD 

VNAHAKGAFFNPKYQHEGFYFGETPLAIAACTNQPEIVQLLMEHEQTDITSRDSRGNNIL 

HALVTVAEDFKTQNDFVKRMYDMILLRSGNWELETTRNND 

LS RE I KE KRLRS LS RK FTDWAYG P VS S S L YDLTNVDTTTDN S VLE I TVYNTN I DNRHEML 
TLEPLHTLLHMKWKiCFAKHMFFLSFCFYFFYNITLTLVSYYRPREEEAIPHPIALTHKM 
WLQLLGRMFVLIWAMCISVKEGIAIFLLRPSDLQSILSDAWFHFVFFIQAVLVILSVFLY 
LFAYKEYIACLVLAMALGWANMI/TC^ 

FGVALASLIEKCPKDNKDCSSYGSFSDAVLELFKLTIGLGDLNIQQNSKYPILFLFIiLIT 

WILTFVI^Ii^LIALMGETVElW^ 

CKVAEDDFRLCLRINEVKWTEX 
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BacktoorfanaLcgi 

Analysis of Fbh52920c (1013 aa) 



ifj rttl hits . "rifis_r*cep pyr KfcatrfeCwp 



1 -2- 




5.4 5.< 



1 1 1 1 1 1 ! 1 1 I M M 1 1 " I M 1 ! M 1 1 M M 1 1 M M M M 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 ! 1 : 1 ! 1 1 1 1 1 1 1 : s | 1 1 ! * 1 1 1 1 ■ 1 1 : 1 f 1 1" I 1 " " I I M 1 . ! i 
1 41 81 121 161 2*1 241 281 321 3S1 401 441 481 521 561 601 641 681 721 761 801 841 B81 921 361 1001 



>Fbh52920c 

MVGGCRWTEDVEPAEVKEKMSFRAARLSMRNRRNDTLDSTRTLYSSASRSTDLSYSESDL 
VNFXQANFKKJIECVFFTKDSKATENVCK^ 

AFGDIQFETLGKKGKYI RLSCDTDAEILYELLTQHWHliKTPNIA/T SVTGGAKNFALKPRM 
RKIFSRLIYIAQSKGAWILTGGTHYGLMKYLGEVVRDNTISRSSEENIVAIGIAAWGMVS 
NRDTLIRNCDAEGYFIuAQYI^lDDFT 

EKYI S ERTIQDSNYGGKI P IVCFAQGGGKETIiKAINTS I KNKI PCWVEGSGQ I ADVI AS 

LVEVEDALTSSAVKEKLVRFLPRWSRLPEEErrESWIKWLKEILECSHLDW 

EI VSNAI S YALYTCAFSTS EQDKDNWNGQLKXjLiLEWNQLDIiANDE I FTNDRRWEKSKPRLR 

DTI IQVTWLENGRIKVESKDVTIX?KASSHMLVVLKSADr^ 

NGLNI^UCFLTHDVLTELFSNHFSTLVYRNI^IAK^ 

IWGRDEMDIELHDVSPITRHPIiQALFIOT^ 

TLAKVKirorNAAGESEELi7U^EYT^TRAVGESTVWNAVV 

TECYS SDEDLAEQLLVYSC EAWGGSNCLELAVEATDQHF I AQPGV QNFLSKQWYGEI SRD 
TKNWKI ILCLFI I PLVGCGFVSFRKKPVDKHKKLLWYYVAFFTS PFWFSWNWFYIAFL 
LLFAYVLL^FHSVPHPPELVLYSLVFV^ 

STRSPGSRSRHSFHTSLQAEGASSGIiGQPRKGDPQCSGGLKGSSSAAKVGAQAEEVPRAS 
EGCEDCQHAVTSQKRKTAMDQTDEDLFPYGAFYQFIjMI SRSFRGEEMS IGKQH 
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Protein Family / Domain Matches, HMMer version 2 

Searching for complete domains in PFAM 

hmmpfam - search a single sea; against HMM database 

HMMER 2.1.1 (Dec 1998) 

Copyright (C) 1992-1998 Washington University School of Medicine 

HMMER is freely distributed under the GNU General Public License (GPL) . 

HMM file: /prod/ddm/seqanal/PFAM/pfam5.3/Pfam 
Sequence file: /prod/ddm/wspace/orf anal/oa-script - 18733 . seq 

Query: Fbh52920c 

Scores for sequence family classification (score includes all domains) : 
Model Description Score E-value N 



Trans_recep Transient receptor 27.9 6.1e-08 2 

pyridoxal_deC Pyridoxal- dependent decarboxylase conse 3.9 2.8 1 

Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 



Trans_recep 1/2 720 778 . . 30 89 . . 21.7 4.5e-06 

pyridoxal_deC 1/1 791 806 380 398 . J 3.9 2.8 

Trans_recep 2/2 820 876 131 212 1.5 5.7 

Alignments of top-scoring domains: 

Trans^recop: domain 1 of 2, from 720 to 778: score 21. 7, E = 4.5e-06 

*->LEvII^deessqkLlergslarganI^sRLKLAIkYkqKkFVAHpNc 
+++++ de+ +++L1 +a+g +L LA ++ + F A+p++ 
Fbh52920c 720 FTECYSSDEDLAEQLLVYSCEAWG-GSNCLELAVEATDQHFIAQPGV 765 

QQlLasiWYdgLs<-* 
Q WY++ s 

Fbh52920c 766 QNFLSKQWYGEIS 778 

pyridoacal_deC : domain 1 of 1, from 791 to 806: score 3.9, E = 2.8 

* ->F<5i vadpernlglVcFr 1K< - * 
F+i+ +++g V+Fr+K 
Fbh52920c 791 FIIPL VGCGFVS FRKK 806 

Trans_recep : domain 2 of 2, from 820 to 876: score 1.5, E = 5.7 

*->kf lRkPFmKFilHaaSYltFLglLLilaSqrfegif f llfaf trmkt 
+f+++PF++F + Y+ + FL 1+++++ +f 
Fbh5 2920c 82 0 AFFTS PFWFSWNWFY I AFLLLFAYVLLMDFH 852 

elaeqelrerGpvpswvEHIilWViGlIWeEiKq<-* 
+vp + El +++ V+ 1 + E++q 
Fbh52920c 853 SVPHPPELVLYSLVFVLFCDEVRQ 876 
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Transmembrane Segments Predicted by MEMS AT 
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>Fbh52920c 

MVGGCRWTED\rePAEVKEKMSFRA^^ 

VNFIQANFKKRECVTFTKDSKATEWCKCGYAQSQHMEGTQINQSEKWbTf 

AFGDIQFETIXSKKGKYIRLSCDTDAEILYELLTQHWHLKTPNLVISVTGGAK^ 

RKI FS RL» I YI AQS KGAWILTGGTHYGLMK YLGEVVRDNTI S RS S EENTVAIG I AAWGMVS 

NRDTL I RNC DAEG YFLAQYLMDD FTRDPL Y I LDNNHTHLLLVDNGCHGH PTVEAKLRNQL 

EKYISERTIQDSNYGGKIPIVCFAQGGGKETLKAINTSIKKKIPCVVVEGSGOIADVIAS 

LVEVEDALTSSAVKEKLVRFLPRTVS RLPEEETESWIKWLKEILEC SHLI/TVTKMEEAGD 

EIVSNAISYALYKAI^STSEQDKDNWNGQLKLI^ 

DTI IQVTWIiENGRI KV^SKDVTIX5KASSHMLVVIiKSADIiQEVMFTALIKDRPKFVRLFLE 

RNGRDEMDI ELHDVS P ITRH PLQALF IWAI LQNKKELS KVI WEQTRGCTIiAALGAS KLI»K 
TLAJCVI^DIblAAGESEELANEYIiTRAVGESTVWNAVVGADLP 

STRSPGSRSRHSFHTSI^AEGASSGIX^PRKGIiPQCSGGLKGSSSA/OCV^GAQAEEVPRAS 
EGCEDCQHAVTSQKRKTAMDQTDEDIjF P YGAFY QFLMI SRSFRGEEMS IGKQH 
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Fbh52920c 
hTRP7 



MVGGCRWTEBVEPAEVKEKMSFRJU^ 

MEPS ALRKAGS EQEEGFEGLPRRVTDLGMVSNLR - RSNS SLFKSWRIjQCPFGNNDKQES - 
* ***:.* 



* * * * 



Fbh52920c 
hTRP7 



Fbh52920c 
11TRP7 



Fbh5292 0c 
hTRP7 



Fbh52920c 
hTRP7 



Fbh52920c 
HTRP7 



Fbh52920c 
HTRP7 



Fbh52920c 
hTRP7 



Fbh52920c 
|iTRP7 



Fbh52920c 
hTRP7 



Fbh52920c 
fiTRP7 



fbh52920c 
tTRP7 



Fbh52920c 
hTRP7 



LVNF I QANFKKRECVFFTKDSKATEN VCKCGYAQSQHMEGT- - -QINQSEKWNYKKH 

LS SWI PEN I KKKECVYFVESSKIjSDAGKVVCQCGYTHEQHLEEATKPHTFQGTQWDPKKH 

* >: * * : ** : ***:*.:. ** :: ** : * ** ' : . * * «' * ' * *• : * : *** 

TKEFPTDAFGD IQFETLGKK- GKY I RLSCDTDAE I LYELLTQHWHLKTPNLVI SVTGGAK 
VQEMPTDAFGD I VFTGLSQKVKKYVRVSQDTPS SVI YHLfMTQHWGLDVPNIiLI S VTGGAK 
* *.:* **:*:* ** :.::*_*:**** 

NFALKPRMRKI FS R - L I Y I AQS KGAW I LTGGTHYGLM KYLGE WRDNT I SR S S EEN - I VA 
NFNMKPRLKSIFRRGLVKVAQTTGAWIITGGSHTGVMKQVGEAVRDFSLSSSYKEGELIT 

* * .***..** * * : : ** ;- **** : *** : * * : * * .★*,*** : : * * : * . ::: 

I G I AAWGMVSNRDTL I RNCDAEGY FLAQYLMDDFTRDPL.Y I LDNNHTHLLLVDNGCHGHP 

IGVATWGTVHRREGLIH PTGS F PAE Y I LDEDGQGNLTCLDSNHSHF I LVDDGTHGQY 

**.*-*★ * ** : . * * *:*::*: * * * . * * : * : : * * * : * **: 

TVEAKLRNQLEKYI SERTIQDSNYGGKI PI VCFAQGGGKETLKAINTSIKKKIPCVWEG 
GVE I PLRTRLE KF I S EQTKERGGVA I KI PI VCWIiEGGPGTLHTIDNATTNGTPCVWEG 
**.***.**★.* . mm m ******.. ** **::*:.: .* ******* 

SGQIADVIASLV- -EVEDAIjTSSAVKEKLVRFIjPRTVSRIjPEEETESWIKWIjKEIIjECSH 
SGRVADVTAQVANLPVSD - 1 TI SLIQQKLSVFFQEMFETFTESRIVEWTKKIQDI VRRRQ 

* * . - * m * : * * ; : : ** *: . . * * :::*:. 

LIjTVI KMEEAGDE I VSNA I S YALYKA - FS TS EQDKDIvT^GQLKIjLIjE WNQLDL.ANDE I FT 
LLTVFREGKDGQQDVDVAI I^AIiLKASRSQDHFGHENVHDHQLKIiAVAPflNRVDI ARSE I FM 
****-- . *- ; *, ** ** ** * **** : **::*:*..*** 

NDRRWEKS- -KPRLRDTI IQ VTWUSNGRI KVESKDVTDGKASSHMLiVVIjKSADIj 

DEWQWKPSDIiHPTMTAALISNKPEFVKLFLENG VQLKEFVTWDTLLYLYENLDPSCL 

...*.* .*- -•* : **** *:*:.. - : :: * . . : * 

QE^mFT-ALIKDRPKFVRLFIiENGIiNLRKFLTHDVbTEL 

FHSKLfQKVIiVED - PE - RPACAPAAPRIjQMHHVAQVLRELLG - DFTQPLYP - -RPRHNDR- 
: .*::* * : . . * : , . :** **:. :* : :*. 

DALLTFVWKLVANFRRGFRKEDRNGRDEMD I ELHDVS P I TRHPLQALF I WAI LQNKKEDS 
IiRXiLiIj PVPHVKLNVQ - GVS - - LRS - - LYKRS SGH - VT - FTMD P I RDLL I WAI VQNRRELA 
** * . - * m . *_ *. . * *: :* .*:: *:****:**::**: 

KVI WEQTRGCTLAALGAS KXiLKTIJVKVKND INAAGESEELANEYLTRAVGES TVWNAWG 

GI I WAQSQDC I AAALACS KI LKELSKEEEDTD S SEEMLALAEEYEHRAIG V- 

.** * ::- * *** % ^ ** : ** * : * : : * ::: * **.** ** : * * 

ADLPCGTDIASGTHRPIX^ELFTECYSSDEDIiAE 

FTECYRKDEERAQKLI/TRVSE^WGKTTCI^ 

***** .**: *::**. .**** :.**:**:**.* : 



Fbh52920c 
KTRP7 



Pbh52920c 
hTRP7 



FIAQPGVQNFLSKQWYGEI SRDTKSlWKI IIiCLFI IPLVGCGFVSSRKKPVDJ 

_ - -5|jy2KRijQI 



DKHKKLLWYY 

raSH^IO^LTIO^ SF^lEKRIiQDVGTPAARA 
* : . : * : * ** : * * : *: : * *. *•• ** : : :**: * : ;*** z * :r. 



***♦.*.**♦ * :: *:*** ******::**:.★* * . : * :* *.*:*:** 
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Fbh52920c 
hTRP7- 



Fbh52920c 
hTRP7 



Fbh52920c 
hTRP7 



Fbh52920c 
hTRP7 



Fbh52 92 0c 
hTRP7 



Fbh52920c 
hTRP7 



Fbh52 92 0c 
hTRP7 



Fbh52920c 
hTRP7 



Fbh5292 0c 
WTRP7 



Fbh52920c 
tlTRP7 



Fbh52920c 
[1TRP7 



WO 02/00722 PCT/US01/20640 

"F -S AAP — • SfiQr z KEfPSgR'* 

YDPDEOTLiTOCAALYFSDFWNKLDVGAI^ 

*-.-*** ** ** # * * 

— NSIWPASSTRSP--GSRSRH SF HTSLQAEGASS 

. RLMH I FT IS KT.LG PKTI I VKRMMKDVFFFLFL1I1AVWVVS FGVAKQAI L IHNERRVDWLFR 

*.*★* * ** * . . 

• • . » • ..... 

G — - -LGQ-P — RKGL PQ-CS-GG LKGS S S AAKVGAQAE E V PR ASE 

GAVYHS YLTI FGQ-I PGYI DGVNFNPEHCS PNGTDP YKPKCPESDATQQRPAFPE WLTVliL 
★ -★■**. * - *.**_* *...:.. * . * 

GCEDCQHA VTS QKR KTAMDQT DEDLFP - Y-G AFYQFLMI S 

LCLYLLFTNILIjLNLLIAMFNYTFQQVQEHTDQIWKFQRHDLIEEYHGRPAAPPPFILLS 

- -RSF- -RGEEMS igkqh 

HLQLF I KRVVLKTPAKRHKQLKNKLEKNEEAALLSWEI YLKENYLQNRQFQQKQRPEQK I 

EDISNKVDAMVDLLDLDPLKRSGSMEQRLASL 

VPTLASQKAAEEPDAEPGGRKKTEEPGDSYHWARHLLYPNCPVTRFPWNEKVPWE^ 
LI YDPPFYTAERKDAAAMDPMGDTLEPLSTIQYNVVIX5LRDRRS FHGPYTVQAGLPLNPM 
GRTGLRGRGS LSCFGPNHTL Y PMVTRWRRNEDGAI CRKS I KKMLE VLWKLPLS EHWAL P 
GGSREPGEMLPRKIiKRI LRQEHWPS FENLLKCGMEVYKGYMDDPRNTONAWI ETVAVSVH 
FQDQNDVELNRIjNSNIjHACDSGAS I RWQWDRRI PL YANHKTLLQKAAAE FGAHY 



FIGURE 10 



16/23 



BNSDOCID: <WO 0200722A2_I_> 



WO 02/00722 

jiajstal w (1.74) multiple sequence alignment 



PCT/US01/20640 



Fbh52920c 
hMELASTATIN 



^ IRVSYDTKPDSLLHLMVKDWQLELPKLLI SVHGGLQNFEMQPKLKQVFGKGLI KAAMT 1 



Fbh52920c 
flMELAS TAT I N 



- -MVG G--CR W TEDVEPAEVK F 

TG AW X FTGGVS TG VI SHVGDALKDHS S KS RGRVCAI G I A P WG I VENKEDLVGKD VTRVYC 



Fbh52920c 
hMELASTATIN 



KMSFRAARIiSMRN RRNDTLDST RTLYSSAS -RSTDLSYSE - SDLV 

TMSNPLS KXi SVLNNS HTH FI liADNGTLGKYGAEVKLRRLLEKH I S LQKI NTRLGQGVPLV 



Fbh52920c 
FiMELAS TAT I N 



NFIQA NFKKRECVFFTKDS KATENVCK- CGYAQS QHM EGTQINQS E - KWN 

GLWEGGPNWS I VLE YLQEEP PI PWI CDGSGRASD I LS FAHKYCEEGG 1 1 NES LREQI 
* .... . * * * * ***•*.*. 



Fbh52920c 
hMELASTATIN 



YKKHTKEFPTDAFGDIQ - FETL.G KKGKY I RLSCDTDAE I LYELLTQHWH LK 

LVT I QKTFNYNKAQSHQLFA 1 1 MECMKKKELVTVFRMGS EGQQDI EMAI LTALLKGTNVS 



Fbh5292 0c 
hMELASTATIN 



TPN-LVI SVTGG AKNFALKPRMRKI FSRLIYIAQS KG AW IL.T- GGTH 

APDQLSIJU^WNRVDIARSQIFVFGP-HWTPIXSSI^ 



Fbh52920c 
hMELASTATIN 



YGLMKYLGEV VRDNTISRSSEE-NIV- AIGIAAWG-MVSNR-D TLIRN-CDAEG 

KGKGKKKGKVKEEVEEETDPRKI ELLNWVNALEQAMIJDALVLDRVDFVKXiLI ENGVNMQH 



Fbh52920c 
hMELASTATIN 



YFLAQ YLMDDF - TRD - P LYILDN NHT HLLLVDNGCHGHPTVEAKLRNQL 

FLTI PRLEELYNTRLGPPNTLHLLVRDVKKSNLPPDYHI S L I D I GL VLE YLMGGAYRCNY 



Fbh52920c 
hMELASTATIN 



EKYI SERT IQDSNYGGKI P I VCFAQGG GKE TLKAINTS I KN 

TRKN - FRTL YNNLFGPKRPKALKLLGMEDDEPPAKGKKKKKKKKEEE I D I DVDDPAVSR F 
★ * - - * * ★ * ** - :*•.:.: 



Fbh52920c 
hMELASTATIN 



KIP- -CWVEGSGQIADVIASLVEV- - EDALTSSAVKEKLVRFLPR TVSRL 

QYPFHELMVWAVLMia^QKMAVFLWQRGEESMAKALVACKLYK^^ SQD 
* • * - - * - - » « - ■* ** ... : * * 



Fbh52920c 
[iMELASTATIN 



PEEETESWIKWLKEILECS--HLLTVIKMEEAGD-EIVSNAISYAL YKAFS--TS 

LDNNSKDFGQLALELLDQS YKHDEQI AMKLLTYELKNWSNSTCLKLAVAAKHRDF IAHTC 
........ * - * - ★ * - - - * * . * : : * * 



Pbh52920c 
hMELASTATIN 



EQ- -DKDNWNGQLK LLLEWNQLD LAN-D- E 

SQMLLTDMWMGRLRMRiaroGLKVIMGILLPPTILFL 
* * * * - * - * - * * - * * * * 



Fbh52920c 
[iMELASTATIN 



IFT--N-D RRW - EKSKPRLRDT I IQVTWLENG - RI KVES KDVT - DGKAS SHMLV 

ENTDANADAG SRKGDE ENEHKKQRS I PIGTKICEFYNAPI VKFWFYTI S YLGYLLLFNYV 
★ ★ * *. * . . ♦ • • * - - i * r * . : = * * 



?bh52920c 
llMELASTATIN 



VLKSAD LQEVMFTALI KDRP - KFVRLFLEN - - G - LN - - LRKFLTHDV - LTELFSN - 

ILVRMDGWPSLQEWI VI SYIVSLALEKIREILMSEPGKLSQKI KVWLQE YWNI TDLVAI S 



Fbh52920c 
hMELASTATIN 



HFS T - LVYRNLQI AK - NS YNDALL TFVWKLVANFRRGFRKED RNGRDEMDIE 

TFM I GAI LR - LQNQP YMGYGRVI YCVD 1 1 FWY I RVLD I FGTVNKYLGPYVMMIGKMMIDML 
* • * ** _ * . . . * : . * * *: s*2 
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Pbh52920c 
hMELASTATIN 



LHDVS PITR HP LQALF- --IWAIL Q N 

Y FWT MLWLMS FGVARQAI LH PEE KPS WKLARN I FYMP YWM I YGEVFADQ I DLYAME I N 



Fbh52920c 
hMELAS TATIN 



K KB LSKVT W- -EQTRGC TLAALGAS KLLKTLAKVKNDIN 

PPCGENLYDEEGKRLPPCIPGAWLTPADMACYLLVANILLVNLLIAVF^^ 



Fbh52920c 
hMELASTATIN 



AAGESE - -ELANEYL.TR AVGESTVWNAVVGADLPCGTDIASGTHRPD- GGELFT 

QVWKFQRYQLIMTFHDRPVLPPPMI ILSHI YI I TMRLSGRCRKKREGDQEERDRGLKLFL 
. :: : * : * : *:: :: * . . .__.**•** 



Fbh52920c 
hMELAS TAT I N 



ECYSS DEDLA EQLLVYSCEAWGGSNCLELAVEATDQHF 

SDEELIO^LHEFEEQCVQEHFREKEDEQQSSSDERIRVTSERVENMSMRLEEINER----ETF 
- * * * * . . * * *** * . * 



Fbh52920c 
hMELASTATIN 



I AQPGVQNF - LSKQWYGEI S-RDT KNWKI I LCLFI I PLVG CG- FVSFRKKPV 

MKTS - LQTVDLRLAQLE ELSNRMVNALENLAG I DRS DL 1 QARS RAS S E CEATYLLRQS S I 
: - * * * . ■* * . *■ * . * -* 



Fbh52920c 
HMELASTATIN 



D KH- - - KKXjLVTYYVAF FT S P FV - VFS WNWFY I A FLLiLFAYVLLMDF 

NSiUXSYSLYRYHFNGEELLFEDTSLSTSPGTGVRKKTCSFRIKEEKDVKTHLVPECQNSL 



Fbh52920c 
hMELASTATIN 



H SVP H P PELVLY S LVFVLF CDE VRQGR PAAPSA 

HLSLGTSTSATPDGSHLAVDDLKNAEESKLGPDIGISKEDDERQTDSKKEETISPSLNKT 



Fbh52920c 
hMELASTATIN 



GPAK PTPTRN SI-WPASS TR-SPGS RSRHSFHTS 

DVIHGQDKSDVQNTQLTVETTNI EGTI SYPLEETKI TR YFPDETINACKTMKSRSFVYSR 
* * * •* - * - * ** * . ** 



Fbh52920c 
hMELASTATIN 



LQ- -AEGA SSGLGQPRKGLPQCS - -GGLKGSSS AAKVGAQAEEVPRASE 

GRKLVGGVNQDVEYS S I TDQQLTTE WQCQVQKITRSHS TD I PY I VSEAAVQAEQKEQFAD 



Pbh5292 0c 
hMELASTATIN 



GCEDCQHAVTSQKRK TAMDQT-DBDLFPYG- -AFYQF-LMISRSFRGEE MSI 

-MQDEHHVAEAI PRI PRLSLTI TDRNGMENLLS VKPDQTLG FPSLRS KSLHGHPRNVKS I 



Fbh52920c 
HMELASTATIN 



-GKQ H 

QGKLDRSGHASSVSSLVIVSGMTAEEKKVKKEKASTETEC 
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1 

SEQUENCE LISTING 

<110> Millennium Pharmaceuticals, Inc., et al. 

<120> 4 8000 AND 52920, NOVEL HUMAN CALCIUM 
CHANNELS AND USES THEREOF 

<130> MNI-170PC 

<150> US 60/214, 176 
<151> 2000-06-26 

<160> 6 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 4586 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS , 

<222> (146) . . . (2368) 

<400> 1 

ccacgcgtcc gattccagcc atccctctgc ctgcaatgag agcttcccgc cgcctcagcc 60 
acagtcccac ccgggggcct tgggccccag acatgcggtg atctcagggc aagggttgcc 120 
acgaccaccc agaacctcac cagcc atg aaa gcc cac ccc aag gag atg gtg 172 

Met Lys Ala His Pro Lys Glu Met Val 
1 5 

cct etc atg ggc aag aga gtt get gcc ccc agt ggg aac cct gcc gtc 220 
Pro Leu Met Gly Lys Arg Val Ala Ala Pro Ser Gly Asn Pro Ala Val 
10 15 20 25 

ctg cca gag aag agg ccg gcg gag ate acc ccc aca aag aag agt gca 2 68 
Leu Pro Glu Lys Arg Pro Ala Glu lie Thr Pro Thr Lys Lys Ser Ala 
30 35 40 

cac ttc ttc ctg gag ata gaa ggg ttt gaa ccc aac ccc aca gtt gcc 316 
His Phe Phe Leu Glu lie Glu Gly Phe Glu Pro Asn Pro Thr Val Ala 
45 50 55 

aag acc tct cct cct gtc ttc tec aag ccc atg gat tec aac ate egg 364 
Lys Thr Ser Pro Pro Val Phe Ser Lys Pro Met Asp Ser Asn lie Arg 
60 * 65 70 

cag tgc ate tct ggt aac tgt gat gac atg gac tec ccc cag tct cct 412 
Gin Cys lie Ser Gly Asn Cys Asp Asp Met Asp Ser Pro Gin Ser Pro 
75 80 85 

caa gat gat gtg aca gag acc cca tec aat ccc aac age ccc agt gca 4 60 
Gin Asp Asp Val Thr Glu Thr Pro Ser Asn Pro Asn Ser Pro Ser Ala 
90 95 100 105 

cag ctg gcc aag gaa gag cag agg agg aaa aag agg egg ctg aag aag 508 
Gin Leu Ala Lys Glu Glu Gin Arg Arg Lys Lys Arg Arg Leu Lys Lys 
110 115 ~ " 120 



cgc ate ttt gca gcc gtg tct gag ggc tgc gtg gag gag ttg gta gag 



556 
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PCT/US01/20640 



Arg lie Phe Ala Ala Val Ser Glu Gly Cys Val Glu Glu Leu Val Glu 
125 130 135 

ttg ctg gtg gag ctg cag gag ctt tgc agg egg cgc cat gat gag gat 604 
Leu Leu Val Glu Leu Gin Glu Leu Cys Arg Arg Arg His Asp Glu Asp 
140 145 150 

gtg cct gac ttc etc atg cac aag ctg acg gee tec gac acg ggg aag 652 
Val Pro Asp Phe Leu Met His Lys Leu Thr Ala Ser Asp Thr Gly Lys 
155 160 165 

ace tgc ctg atg aag gec ttg tta aac ate aac ccc aac acc aag gag 700 
Thr Cys Leu Met Lys Ala Leu Leu Asn lie Asn Pro Asn Thr Lys Glu 
170 175 180 185 

ate gtg egg ate ctg ctt gec ttt get gaa gag aac gac ate ctg ggc 748 
lie Val Arg lie Leu Leu Ala Phe Ala Glu Glu Asn Asp lie Leu Gly 
190 195 200 

agg ttc ate aac gee gag tac aca gag gag gec tat gaa ggg cag acg 796 
Arg Phe lie Asn Ala Glu Tyr Thr Glu Glu Ala Tyr Glu Gly Gin Thr 
205 210 215 

gcg ctg aac ate gec ate gag egg egg cag ggg gac ate gca gec ctg 84 4 
Ala Leu Asn lie Ala lie Glu Arg Arg Gin Gly Asp lie Ala Ala Leu 
220 225 230 

etc ate gec gec ggc gec gac gtc aac gcg cac gee aag ggg gec ttc 8 92 
Leu lie Ala Ala Gly Ala Asp Val Asn Ala His Ala Lys Gly Ala Phe 
235 240 245 

ttc aac ccc aag tac caa cac gaa ggc ttc tac ttc ggt gag acg ccc 940 
Phe Asn Pro Lys Tyr Gin His Glu Gly Phe Tyr Phe Gly Glu Thr Pro 
250 ~ 255 260 265 

ctg gee ctg gca gca tgc acc aac cag ccc gag att gtg cag ctg ctg 988 
Leu Ala Leu Ala Ala Cys Thr Asn Gin Pro Glu lie Val Gin Leu Leu 
270 275 280 

atg gag cac gag cag acg gac ate acc teg egg gac tea cga ggc aac 
1036 

Met Glu His Glu Gin Thr Asp lie Thr Ser Arg Asp Ser Arg Gly Asn 
285 290 295 

aac ate ctt cac gee ctg gtg acc gtg gee gag gac ttc aag acg cag 
1084 

Asn lie Leu His Ala Leu Val Thr Val Ala Glu Asp Phe Lys Thr Gin 

300 305 310 

aat gac ttt gtg aag cgc atg tac gac atg ate eta ctg egg agt ggc 
1132 

Asn Asp Phe Val Lys Arg Met Tyr Asp Met lie Leu Leu Arg Ser Gly 
315 320 325 

aac tgg gag ctg gag acc act cgc aac aac gat ggc etc acg ccg ctg 
1180 

Asn Trp Glu Leu Glu Thr Thr Arg Asn Asn Asp Gly Leu Thr Pro Leu 

330 335 340 345 

cag ctg gee gee aag atg ggc aag gcg gag ate ctg aag tac ate etc 
1228 
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Gin Leu Ala Ala Lys Met Gly Lys Ala Glu He Leu Lys Tyr He Leu 
350 355 360 

agt cgt gag ate aag gag aag egg etc egg age ctg tec agg aag ttc 
1276 

Ser Arg Glu He Lys Glu Lys Arg Leu Arg Ser Leu Ser Arg Lys Phe 
365 370 375 

acc gac tgg gcg tac gga ccc gtg tea tec tec etc tac gac etc acc 
1324 

Thr Asp Trp Ala Tyr Gly Pro Val Ser Ser Ser Leu Tyr Asp Leu Thr 

380 385 390 

aac gtg gac acc acc acg gac aac tea gtg ctg gaa ate act gtc tac 
1372 

Asn Val Asp Thr Thr Thr Asp Asn Ser Val Leu. Glu He Thr Val Tyr 
395 400 405 

aac acc aac ate gac aac egg cat gag atg ctg acc ctg gag ccg ctg 
1420 

Asn Thr Asn He Asp Asn Arg His Glu Met Leu Thr Leu Glu Pro Leu 

410 415 ~ 420 425 

cac acg ctg ctg cat atg aag tgg aag aag ttt gec aag cac atg ttc 
14 68 

His Thr Leu Leu His Met Lys Trp Lys Lys Phe Ala Lys His Met Phe 
430 435 440 

ttt ctg tec ttc tgc ttt tat ttc ttc tac aac ate acc ctg acc etc 
1516 

Phe Leu Ser Phe Cys Phe Tyr Phe Phe Tyr Asn He Thr Leu Thr Leu 
445 450 455 

gtc teg tac tac cgc ccc egg gag gag gag gee ate ccg cac ccc ttg 
1564 

Val Ser Tyr Tyr Arg Pro Arg Glu Glu Glu Ala He Pro His Pro Leu 
460 465 470 

gec ctg acg cac aag atg ggg tgg ctg cag etc eta ggg agg atg ttt 
1612 

Ala Leu Thr His Lys Met Gly Trp Leu Gin Leu Leu Gly Arg Met Phe 
475 480 485 

gtg etc ate tgg gec atg tgc ate tct gtg aaa gag ggc att gec ate 
1660 

Val Leu He Trp Ala Met Cys He Ser Val Lys Glu Gly lie Ala He 

490 495 500 505 

ttc ctg ctg aga ccc teg gat ctg cag tec ate etc teg gat gec tgg 
1708 

Phe Leu Leu Arg Pro Ser Asp Leu Gin Ser He Leu Ser Asp Ala Trp 
510 515 ~ 520 

ttc cac ttt gtc ttt ttt ate caa get gtg ctt gtg ata ctg tct gtc 
1756 

Phe His Phe Val Phe Phe lie Gin Ala Val Leu Val He Leu Ser Val 

525 530 535 

ttc ttg tac ttg ttt gec tac aaa gag tac etc gec tgc etc gtg ctg 
1804 
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Phe Leu Tyr Leu Phe Ala Tyr Lys Glu Tyr Leu Ala Cys Leu Val Leu 
540 545 550 

gcc atg gcc ctg ggc tgg gcg aac atg etc tac tat acg egg ggt ttc 
1852 

Ala Met Ala Leu Gly Trp Ala Asn Met Leu Tyr Tyr Thr Arg Gly Phe 
555 560 565 

cag tec atg ggc atg tac age gtc atg ate cag aag gtc att ttg cat 
1900 

Gin Ser Met Gly Met Tyr Ser Val Met He Gin Lys Val He Leu His 

570 575 580 585 

gat gtt ctg aag ttc ttg ttt gta tat ate gtg ttt ttg ctt gga ttt 
1948 

Asp Val Leu Lys Phe Leu Phe Val Tyr He Val Phe Leu Leu Gly Phe 
590 595 i 600 

gga gta gcc ttg gcc teg ctg ate gag aag tgt ccc aaa gac aac aag 
1996 

Gly Val Ala Leu Ala Ser Leu He Glu Lys Cys Pro Lys Asp Asn Lys 
605 610 615 

gac tgc age tec tac ggc age ttc agt gac gca gtg ctg gaa etc ttc 
2044 

Asp Cys Ser Ser Tyr Gly Ser Phe Ser Asp Ala Val Leu Glu Leu Phe 

620 625 630 

aag etc ace ata ggc ctg ggt gat ctg aac ate cag cag aac tec aag 
2092 

Lys Leu Thr He Gly Leu Gly Asp Leu Asn He Gin Gin Asn Ser Lys 
635 640 645 

tat ccc att etc ttt ctg ttc ctg etc ate acc tat gtc ate etc ace 
2140 

Tyr Pro He Leu Phe Leu Phe Leu Leu He Thr Tyr Val He Leu Thr' 

650 655 660 665 

ttt gtt etc etc etc aac atg etc att get ctg atg ggc gag act gtg 
2188 

Phe Val Leu Leu Leu Asn Met Leu He Ala Leu Met Gly Glu Thr Val 

670 675 680 

gag aac gtc tec aag gag age gaa cgc ate tgg cgc ctg cag aga gcc 
2236 

Glu Asn Val Ser Lys Glu Ser Glu Arg He Trp Arg Leu Gin Arg Ala 

685 690 695 

agg acc ate ttg gag ttt gag aaa atg tta cca gaa tgg ctg agg age 
2284 

Arg Thr He Leu Glu Phe Glu Lys Met Leu Pro Glu Trp Leu Arg Ser 
700 705 710 

aga ttc egg atg gga gag ctg tgc aaa gtg gcc gag gat gat ttc cga 
2332 

Arg Phe Arg Met Gly Glu Leu Cys Lys Val Ala Glu Asp Asp Phe Arg 

715 720 725 

ctg tgt ttg egg ate aat gag gtg aag tgg act gaa tggaagaege 
23?8 
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Leu Cys Leu Arg lie Asn Glu Val : 
730 735 

acgtctcctt ccttaacgaa gacccggggc 
2438 

aagattcttc caggaacaac agcaaaacca 
24 98 

tcccggaaac ctcggtgtag aagcggaacc 
2558 

gctgcaggcg gagtcaccga ctctgtgcag 
2618 

ctgctggcct agaagcagag tgcaccctcg 
2678 

ggggcagttg tcaatttgtc tgagtgggaa 
2738 

ctggtgtaaa cctacagcca gcagcagtct 
2798 

gaagcctctg gttttaccaa ttgcaggtcg 
2858 

ggggcagcag tcttgaggag cagggagagg 
2918 

gccccaaccc cagcccacac attgtaccat 
2978 

tgtgggagac ccgtgatccc atcccagagt 
3038 

tgggggagga ggggaatgcg ctctggaaac 
30 98 

accatccttg gagtgaacgt cccctgacac 
3158 

atgtttttag tagcataatt aacacagggt 
3218' 

gttgagaatg aaaattacta caatgaagtt 
3278 

gctttggggc tggatgtaag tattatatat 
3338 

aagcttttct tcacacacac aaaagtctgc 
3398 

cgccgcgatc tgggtgaaag gtcctggctc 
34 58 

tgcctgtgtg ctgggatcgt ggctttcgct 
3518 

tcatcttgga actcagcagt tagtcgcata 
3578 

aaaatgcctc aactgctttg gtacgaagta 
3638 

aagctgaagg ggaagcttgt gagagaaaag 
3698 

gtcctcgtcc tctgatgttc tctttctgaa 
3758 

tgaagtggag acggaaggat tttccaggtg 
3818 

ctgccgccgt tatctggtga aaggattcat 
3878 

tagtaaccat aaatgttaat cttaaatggc 
3938 

ggcgatagat aagttcatct ggcttgaggc 
3998 ' ' 

tctgggccag tttctctgcc ctgggccact 
4 058 

actttgtgcc cctgatgggg ccgattgggg 
4118 



jVS Tro Thr 
74 0 


Glu 




ctgtaagacg 


aacagatttc 


aacaaaatcc 


ctctcaatgc 


atttgaagaa 


gtcgaggaat 


cagagctggt 


gtgcgcgtgc 


gctgtctggc 


agaggctttg 


agggatgatg 


gagtccggct 


tgctcagtgc 


tcagtgggtg 


tctgaactga 


acatcctgga 


ttttgttact 


tggcaaacag 


ggagcctggg 


agcctcctga 


agtcccgggt 


gcttggctgg 


gagagatgga 


tggcgggaaa 


agtctttcct 


cctgccagct 


tcccccgtca 


ctcttctgct 


gtgactgggt 


tgcctgaatt 


gtgcggggga 


cggaggtaag 


ctggatatcc 


acccttccgg 


aacccttcgg 


ggaaaaggag 


cccaaggttc 


aaactgtctc 


aagctgagag 


tttaacttgc 


aatacggaaa 


agacatttca 


tgtgatttta 


aaagtggaga 


cagactgggg 


ttggcctcag 


ggtgcccaga 


gcaagacaaa 


atgagacact 


ccgggcaagt 


cctgctgggc 


ttttcctcgt 


cctgacctca 


cagtagcgca 


gaagcagaaa 


tagcagctgc 


tcgatcgata 


cctcagtacg 


tctcagtggg 


ggaatttaac 


tttttttttt 


aattttaact 


gtgaattttg 


catttgccaa 


gactttgagc 


ttatttttag 


atgacacgga 


gtcagtctcg 


ggggcagagg 


actggggccg 


aaaccaccag 


aaaatccact 


gtaaaaatgt 


tcgaggtgga 


attataaaaa 


agaaatagaa 


atttggcctt 


cagataacat 


aaactgaaga 


gtcggggcct 


agcattgcac 


ctgtgtgcca 


gactagctgg 


acagatagag 


agaggtgggc 


tggggtgtgc 


aggcttcaca 
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atccacagca gcecctgccc tcccagctga cccagggagt aatcgcgtgc tctaagccac 
4178 

agtggtcggg gctgggcatg ggcctctgga gaagagaaga tttgaggaga actgtcctag 
4238 

aggcaggagg agcagatgtg tttcagaatg ggcagaatta ggaaattgag aaagattttg 
4 298 

gctcaacaga atccagcaac tgctccagat gttggagatg tttaagcaga agctggttga 
4 358 

gcacttaatg aggaatgttg ttgaaaatgg tcattggaag aagtttaagg tcccttttag 
4418 

cctggagatt gtacaaatca gcattccaca tctggagtta gctacccgca ttaagcctga 
4 478 

acagacatct tggtctgaaa ggaagtggtt tggattcatg atgccaagct ccacactatg 
4538 

gagctgggaa ttccagaatt gctttgactc agatattaat ggagaaag 
4586 

<210> 2 
<211> 741 
<212> PRT 

<213> Homo sapiens 



<400> 2 



Met 


Lys 


Ala 


His 


Pro 


Lys 


Glu 


Met 


Val 


Pro 


Leu 


Met 


Gly 


Lys 


Arg 


Val 


1 








5 










10 










15 




Ala 


Ala 


Pro 


Ser 


Gly 


Asn 


Pro 


Ala 


Val 


Leu 


Pro 


Glu 


Lys 


Arg 


Pro 


Ala 








20 










25 










30 






Glu 


He 


Thr 


Pro 


Thr 


Lys 


Lys 


Ser 


Ala 


His 


Phe 


Phe 


Leu 


Glu 


He 


Glu 






35 










40 










45 








Gly 


Phe 


Glu 


Pro 


Asn 


Pro 


Thr 


Val 


Ala 


Lys 


Thr 


Ser 


Pro 


Pro 


Val 


Phe 


50 










55 










60 










Ser 


Lys 


Pro 


Met 


Asp 


Ser 


Asn 


He 


Arg 


Gin 


Cys 


He 


Ser 


Gly 


Asn 


Cys 


65 










70 










75 










80 


Asp 


Asp 


Met 


Asp 


Ser 


Pro 


Gin 


Ser 


Pro 


Gin 


Asp 


Asp 


Val 


Thr 


Glu 


Thr 










85 










90 










95 




Pro 


Ser 


Asn 


Pro 


Asn 


Ser 


Pro 


Ser 


Ala 


Gin 


Leu 


Ala 


Lys 


Glu 


Glu 


Gin 








100 










105 










110 






Arg 


Arg 


Lys 


Lys 


Arg 


Arg 


Leu 


Lys 


Lys 


Arg 


He 


Phe 


Ala 


Ala 


Val 


Ser 






115 










120 










125 








Glu 


Gly 


Cys 


Val 


Glu 


Glu 


Leu 


Val 


Glu 


Leu 


Leu 


Val 


Glu 


Leu 


Gin 


Glu 




130 










135 










140 










Leu 


Cys 


Arg 


Arg 


Arg 


His 


Asp 


Glu 


Asp 


Val 


Pro 


Asp 


Phe 


Leu 


Met 


His 


145 










150 










155 










160 


Lys 


Leu 


Thr 


Ala 


Ser 


Asp 


Thr 


Gly Lys 


Thr 


Cys 


Leu 


Met 


Lys 


Ala 


Leu 










165 










170 










175 




Leu 


Asn 


He 


Asn 


Pro 


Asn 


Thr 


Lys 


Glu 


He 


Val 


Arg 


He 


Leu 


Leu 


Ala 








180 










185 










190 






Phe 


Ala 


Glu 


Glu 


Asn 


Asp 


He 


Leu 


Gly 


Arg 


Phe 


He 


Asn 


Ala 


Glu 


Tyr 






195 










200 










205 








Thr 


Glu 


Glu 


Ala 


Tyr 


Glu 


Gly 


Gin 


Thr 


Ala 


Leu 


Asn 


He 


Ala 


He 


Glu 




210 










215 










220 










Arg 


Arg 


Gin 


Gly 


Asp 


He 


Ala 


Ala 


Leu 


Leu 


He 


Ala 


Ala 


Gly 


Ala 


Asp 


225 










230 










235 










240 


Val 


Asn 


Ala 


His 


Ala 


Lys 


Gly Ala 


Phe 


Phe 


Asn 


Pro 


l y s 


Tyr 


Gin 


His 










245 










250 










255 




Glu 


Gly 


Phe 


Tyr 


Phe 


Gly 


Glu 


Thr 


Pro 


Leu 


Ala 


Leu 


Ala 


Ala 


Cys 


Thr 








260 










265 










270 






Asn 


Gin 


Pro 


Glu 


He 


Val 


Gin 


Leu 


Leu 


Met 


Glu 


His 


Glu 


Gin 


Thr 


Asp 






275 










280 










285 








He 


Thr 


Ser 


Arg 


Asp 


Ser 


Arg 


Gly Asn 


Asn 


He 


Leu 


His 


Ala 


Leu 


Val 




290 










295 










300 
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Thr 


Val 


Ala 


Glu 


Asp 


Phe 


Lys 


Thr 


305 










310 






Tyr 


Asp 


Met 


He 


Leu 


Leu 


Arg 


Ser 










325 








Arg 


Asn 


Asn 


Asp 


Gly 


Leu 


Thr 


Pro 








340 










Lys 


Ala 


Glu 


He 


Leu 


Lys 


Tyr 


He 






355 










360 


Arg 


Leu 


Arg 


Ser 


Leu 


Ser 


Arg 


Lys 




370 










375 




Val 


Ser 


Ser 


Ser 


Leu 


Tyr 


Asp 


Leu 


385 










390 






Asn 


Ser 


Val 


Leu 


Glu 


lie 


Thr 


Val 










405 








His 


Glu 


Met 


Leu 


Thr 


Leu 


Glu 


Pro 








420 










Trp 


Lys 


Lys 


Phe 


Ala 


Lys 


His 


Met 






435 










440 


Phe 


Phe 


Tyr 


Asn 


He 


Thr 


Leu 


Thr 




450 










455 




Glu 


Glu 


Glu 


Ala 


He 


Pro 


His 


Pro 


465 










470 






Trp 


Leu 


Gin 


Leu 


Leu 


Gly 


Arg 


Met 










485 








He 


Ser 


Val 


Lys 


Glu 


Gly 


He 


Ala 








500 










Leu 


Gin 


Ser 


He 


Leu 


Ser 


Asp 


Ala 






515 










520 


Gin 


Ala 


Val 


Leu 


Val 


He 


Leu 


Ser 




530 










535 




Lys 


Glu 


Tyr 


Leu 


Ala 


Cys 


Leu 


Val 


545 










550 






Asn 


Met 


Leu 


Tyr 


Tyr 


Thr 


Arg 


Gly 










565 








Val 


Met 


He 


Gin 


Lys 


Val 


He 


Leu 








580 










Val 


Tyr 


He 


Val 


Phe 


Leu 


Leu 


Gly 






595 










600 


He 


Glu 


Lys 


Cys 


Pro 


Lys 


Asp 


Asn 




610 










615 




Phe 


Ser 


Asp 


Ala 


Val 


Leu 


Glu 


Leu 


625 










630 






Asp 


Leu 


Asn 


He 


Gin 


Gin 


Asn 


Ser 










645 








Leu 


Leu 


He 


Thr 


Tyr 


Val 


He 


Leu 








660 










Leu 


He 


Ala 


Leu 


Met 


Gly 


Glu 


Thr 






675 










680 


Glu 


Arg 


He 


Trp 


Arg 


Leu 


Gin 


Arg 




690 










695 




Lys 


Met 


Leu 


Pro 


Glu 


Trp 


Leu 


Arg 


705 










710 






Cys 


Lys 


Val 


Ala 


Glu 


Asp 


Asp 


Phe 










725 








Val 


Lys 


Trp 


Thr 


Glu 









740 
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Gin 


Asn 


Asp 
315 


Phe 


Val 


Lys 


Arg 


Met 
320 


Gly 


Asn 
330 


Trp 


Glu 


Leu 


Glu 


Thr 
335 


Thr 


Leu 


Gin 


Leu 


Ala 


Ala 


Lys 


Met 


Gly 


345 










350 






Leu 


Ser 


Arg 


Glu 


He 
365 


Lys 


Glu 


Lys 


Phe 


Thr 


Asp 


Trp 
380 


Ala 


Tyr 


Gly 


Pro 


Thr 


Asn 


Val 
395 


Asp 


Thr 


Thr 


Thr 


Asp 
400 


Tyr 


Asn 
410 


Thr 


Asn 


He 


Asp 


Asn 
415 


Arg 


Leu 


His 


Thr 


Leu 


Leu 


His 


Met 


Lys 


425 










430 






Phe 


Phe 


Leu 


Ser 


Phe 
445 


Cys 


Phe 


Tyr 


Leu 


Val 


Ser 


Tyr 
460 


Tyr 


Arg 


Pro 


Arg 


Leu 


Ala 


Leu 
475 


Thr 


His 


Lys 


Met 


Gly 
480 


Phe 


Val 
490 


Leu 


He 


Trp 


Ala 


Met 
4 95 


Cys 


He 


Phe 


Leu 


Leu 


Arg 


Pro 


Ser 


Asp 


505 










510 






Trp 


Phe 


His 


Phe 


Val 
525 


Phe 


Phe 


He 


Val 


Phe 


Leu 


Tyr 
540 


Leu 


Phe 


Ala 


Tyr 


Leu 


Ala 


Met 
555 


Ala 


Leu 


Gly 


Trp 


Ala 
560 


Phe 


Gin 
570 


Ser 


Met 


Gly 


Met 


Tyr 
575 


Ser 


His 


Asp 


Val 


Leu 


Lys 


Phe 


Leu 


Phe 


585 










590 






Phe 


Gly 


Val 


Ala 


Leu 
605 


Ala 


Ser 


Leu 


Lys 


Asp 


Cys 


Ser 
620 


Ser 


Tyr 


Gly 


Ser 


Phe 


Lys 


Leu 
635 


Thr 


He 


Gly 


Leu 


Gly 
640 


Lys 


Tyr 
650 


Pro 


He 


Leu 


Phe 


Leu 
655 


Phe 


Thr 


Phe 


Val 


Leu 


Leu 


Leu 


Asn 


Met 


665 










670 






Val 


Glu 


Asn 


Val 


Ser 
685 


Lys 


Glu 


Ser 


Ala 


Arg 


Thr 


He 
700 


Leu 


Glu 


Phe 


Glu 


Ser 


Arg 


Phe 
715 


Arg 


Met 


Gly 


Glu 


Leu 
720 


Arg 


Leu 
730 


Cys 


Leu 


Arg 


He 


Asn 
735 


Glu 
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<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (2223) 



<400> 3 

atg aaa gcc cac ccc aag gag atg gtg cct etc atg ggc aag aga gtt 

Met Lys Ala His Pro Lys Glu Met Val Pro Leu Met Gly Lys Arg Val 
15 10 15 



48 



get gcc ccc agt ggg aac cct gcc gtc ctg cca gag aag agg ccg'gcg 
Ala Ala Pro Ser Gly Asn Pro Ala Val Leu Pro Glu Lys Arg Pro Ala 
20 25 30 



96 



gag ate acc ccc aca aag aag agt gca cac ttc ttc ctg gag ata gaa 
Glu lie Thr Pro Thr Lys Lys Ser Ala His Phe Phe Leu Glu lie Glu 
35 40 45 



144 



ggg ttt gaa ccc aac ccc aca gtt gcc aag acc tct cct cct gtc ttc 
Gly Phe Glu Pro Asn Pro Thr Val Ala Lys Thr Ser Pro Pro Val Phe 
50 55 60 



192 



tec aag ccc atg gat tec aac ate egg cag tgc ate tct ggt aac tgt 
Ser Lys Pro Met Asp Ser Asn lie Arg Gin Cys lie Ser Gly Asn Cys 
65 70 75 80 



240 



gat gac atg gac tec ccc cag tct cct caa gat gat gtg aca gag acc 
Asp Asp Met Asp Ser Pro Gin Ser Pro Gin Asp Asp Val Thr Glu Thr 

85 90 95 



288 



cca tec aat ccc aac age ccc agt gca cag ctg gcc aag gaa gag cag 
Pro Ser Asn Pro Asn Ser Pro Ser Ala Gin Leu Ala Lys Glu Glu Gin 
100 105 110 



336 



agg agg aaa aag agg egg ctg aag aag cgc ate ttt gca gcc gtg tct 
Arg Arg Lys Lys Arg Arg Leu Lys Lys Arg He Phe Ala Ala Val Ser 
115 120 125 



384 



g a 9 99 c tgc gtg gag gag ttg gta gag ttg ctg gtg gag ctg cag gag 
Glu Gly Cys Val Glu Glu Leu Val Glu Leu Leu Val Glu Leu Gin Glu 
130 135 140 



432 



ctt tgc agg egg cgc cat gat gag gat gtg cct gac ttc etc atg cac 
Leu Cys Arg Arg Arg His Asp Glu Asp Val Pro Asp Phe Leu Met His 
145 ~ 150 155 160 



480 



aag ctg acg gcc tec gac acg ggg aag acc tgc ctg atg aag gcc ttg 
Lys Leu Thr Ala Ser Asp Thr Gly Lys Thr Cys Leu Met Lys Ala Leu 
165 170 175 



528 



tta aac ate aac ccc aac acc aag gag ate gtg egg ate ctg ctt gcc 
Leu Asn He Asn Pro Asn Thr Lys Glu He Val Arg He Leu Leu Ala 
180 18'5 190 



576 



ttt get gaa gag aac gac ate ctg ggc agg ttc ate aac gcc gag tac 
Phe Ala Glu Glu Asn Asp He Leu Gly Arg Phe He Asn Ala Glu Tyr 
195 200 205 



624 
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aca gag gag gcc tat gaa ggg cag acg gcg ctg aac ate gec ate gag 672 
Thr Glu Glu Ala Tyr Glu Gly Gin Thr Ala Leu Asn lie Ala lie Glu 
210 215 220 

egg egg cag ggg gac ate gca gcc ctg etc ate gcc gcc ggc gcc gac 720 
Arg Arg Gin Gly Asp lie Ala Ala Leu Leu lie Ala Ala Gly Ala Asp 
225 230 235 240 

gtc aac gcg cac gcc aag ggg gcc ttc ttc aac ccc aag tac caa cac 7 68 
Val Asn Ala His Ala Lys Gly Ala Phe Phe Asn Pro Lys Tyr Gin His 
245 250 255 

gaa ggc ttc tac ttc ggt gag acg ccc ctg gcc ctg gca gca tgc acc 816 
Glu Gly Phe Tyr Phe Gly Glu Thr Pro Leu Ala Leu Ala Ala Cys Thr 
260 265 270 

aac cag ccc gag att gtg cag ctg ctg atg gag cac gag cag acg gac 8 64 
Asn Gin Pro Glu lie Val Gin Leu Leu Met Glu His Glu Gin Thr Asp 
275 280 285 

ate acc teg egg gac tea cga ggc aac aac ate ctt cac gcc ctg gtg 912 
lie Thr Ser Arg Asp Ser Arg Gly Asn Asn lie Leu His Ala Leu Val 
290 295 300 

acc gtg gcc gag gac ttc aag acg cag aat gac ttt gtg aag cgc atg 960 
Thr Val Ala Glu Asp Phe Lys Thr Gin Asn Asp Phe Val Lys Arg Met 
305 310 315 320 

tac gac atg ate eta ctg egg agt ggc aac tgg gag ctg gag acc act 
1008 

Tyr Asp Met lie Leu Leu Arg Ser Gly Asn Trp Glu Leu Glu Thr Thr 
325 330 335 

cgc aac aac gat ggc etc acg ccg ctg cag ctg gcc gcc aag atg ggc 
1056 

Arg Asn Asn Asp Gly Leu Thr Pro Leu Gin Leu Ala Ala Lys Met Gly 
340 345 350 

aag gcg gag ate ctg aag tac ate etc agt cgt gag ate aag gag aag 
1104 

Lys Ala Glu lie Leu Lys Tyr lie Leu Ser Arg Glu lie Lys Glu Lys 

355 360 " 365 

egg etc egg age ctg tec agg aag ttc acc gac tgg gcg tac gga ccc 
1152 

Arg Leu Arg Ser Leu Ser Arg Lys Phe Thr Asp Trp Ala Tyr Gly Pro 
370 375 380 

gtg tea tec tec etc tac gac etc acc aac gtg gac acc acc acg gac 
1200 

Val Ser Ser Ser Leu Tyr Asp Leu Thr Asn Val Asp Thr Thr Thr Asp 

385 390 395 400 

aac tea gtg ctg gaa ate act gtc tac aac acc aac ate gac aac egg 
124 8 

Asn Ser Val Leu Glu lie Thr Val Tyr Asn Thr Asn lie Asp Asn Arg 

405 410 415 

cat gag atg ctg acc ctg gag ccg ctg cac acg ctg ctg cat atg aag 
1296 
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His Glu Met Leu Thr 
420 

tgg aag aag ttt gcc 
1344 

Trp Lys Lys Phe Ala 
435 

ttc ttc tac aac ate 
1392 

Phe Phe Tyr Asn lie 
450 

gag gag gag gcc ate 
14 40 

Glu Glu Glu Ala lie 
4 65 

tgg ctg cag etc eta 
1488 

Trp Leu Gin Leu Leu 
485 

ate tct gtg aaa gag 
1536 

lie Ser Val Lys Glu 
500 

ctg cag tec ate etc 
1584 

Leu Gin Ser lie Leu 
515 • 

caa get gtg ctt gtg 
1632 

Gin Ala Val Leu Val 
530 

aaa gag tac etc gcc 
1680 

Lys Glu Tyr Leu Ala 
545 

aac atg etc tac tat 
1728 

Asn Met Leu Tyr Tyr 
565 

gtc atg ate cag aag 
177 6 

Val Met lie Gin Lys 
580 

gta tat ate gtg ttt 
1824 

Val Tyr lie Val Phe 
595 

ate gag aag tgt ccc 
1872 



10 

Leu Glu Pro Leu His Thr 
425 

aag cac atg ttc ttt ctg 

Lys His Met Phe Phe Leu 
440 

ace ctg ace etc gtc teg 

Thr Leu Thr Leu Val Ser 
455 

ccg cac ccc ttg gcc ctg 

Pro His Pro Leu Ala Leu 
470 475 

ggg agg atg ttt gtg etc 

Gly Arg Met Phe Val Leu 
490 

ggc att gcc ate ttc ctg 

Gly lie Ala lie Phe Leu 
505 

teg gat gcc tgg ttc cac 

Ser Asp Ala Trp Phe His 
520 

ata ctg tct gtc ttc ttg 

lie Leu Ser Val Phe Leu 
535 

tgc etc gtg ctg gcc atg 

Cys Leu Val Leu Ala Met 
550 555 

acg egg ggt ttc cag tec 

Thr Arg Gly Phe Gin Ser 
570 

gtc att ttg cat gat gtt 

Val lie Leu His Asp Val 
585 

ttg ctt gga ttt gga gta 

Leu Leu Gly Phe Gly Val 
600 

aaa gac aac aag gac tgc 



PCT/USO 1/20640 

Leu Leu His Met Lys 
430 

tec ttc tgc ttt tat 

Ser Phe Cys Phe Tyr 
445 

tac tac cgc ccc egg 

Tyr Tyr Arg Pro Arg 
4 60 

acg cac aag atg ggg 

Thr His Lys Met Gly 
480 

ate tgg gcc atg tgc 

lie Trp Ala Met Cys 
495 

ctg aga ccc teg gat 

Leu Arg Pro Ser Asp 
510 

ttt gtc ttt ttt ate 

Phe Val Phe Phe lie 

525 

tac ttg ttt gcc tac 

Tyr Leu Phe Ala Tyr 
540 

gcc ctg ggc tgg gcg 

Ala Leu Gly Trp Ala 
560 

atg ggc atg tac age 

Met Gly Met Tyr Ser 
575 

ctg aag ttc ttg ttt 

Leu Lys Phe Leu Phe 
590 

gcc ttg gcc teg ctg 

Ala Leu Ala Ser Leu 
605 

age tec tac ggc age 
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lie Glu Lys Cys Pro Lys Asp Asn Lys Asp Cys Ser Ser Tyr Gly Ser 

610 615 620 

■ 

ttc agt gac gca gtg ctg gaa etc ttc aag etc ace ata ggc ctg ggt 
192 o 

Phe Ser Asp Ala Val Leu Glu Leu Phe Lys Leu Thr lie Gly Leu Gly 
625 630 635 640 

gat ctg aac ate cag cag aac tec aag tat ccc att etc ttt ctg ttc 
1968 

Asp Leu Asn lie Gin Gin Asn Ser Lys Tyr Pro lie Leu Phe Leu Phe 
645 650 655 

ctg etc ate acc tat gtc ate etc ace ttt gtt etc etc etc aac atg 
2016 

Leu Leu lie Thr Tyr Val lie Leu Thr Phe Val Leu Leu Leu Asn Met 
660 665 670 

etc att get ctg atg ggc gag act gtg gag aac gtc tec aag gag age 
2064 

Leu lie Ala Leu Met Gly Glu Thr Val Glu Asn Val Ser Lys Glu Ser 
675 680 685 

gaa cgc ate tgg cgc ctg cag aga gee agg acc ate ttg gag ttt gag 
2112 

Glu Arg lie Trp Arg Leu Gin Arg Ala Arg Thr lie Leu Glu Phe Glu • 
690 695 700 

aaa atg tta cca gaa tgg ctg agg age aga ttc egg atg gga gag ctg 
2160 

Lys Met Leu Pro Glu Trp Leu Arg Ser Arg Phe Arg Met Gly Glu Leu 

705 710 715 " 720 

tgc aaa gtg gee gag gat gat ttc cga ctg tgt ttg egg ate aat gag • 
2208 

Cys Lys Val Ala Glu Asp Asp Phe Arg Leu Cys Leu Arg lie Asn Glu 
725 730. 735 

gtg aag tgg act gaa 
2223 

Val Lys Trp Thr Glu 
740 



<210> 4 
<211> 3042 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (1) . . . (3039) 
<400> 4 

atg gtt gga gga tgc agg tgg aca gaa gac gtg gag cct gca gaa gta 4 8 
Met Val Gly Gly Cys Arg Trp Thr Glu Asp Val Glu Pro Ala Glu Val 
15 10 15 

aag gaa aag atg tec ttt egg gca gee agg etc age atg agg aac aga 96 
Lys Glu Lys Met Ser Phe Arg Ala Ala Arg Leu Ser Met Arg Asn Arg 
20 25 30 
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agg aat gac act ctg gac age acc egg acc ctg tac tec age gcg tct 14 4 

Arg Asn Asp Thr Leu Asp Ser Thr Arg Thr Leu Tyr Ser Ser Ala Ser 

35 40 45 

egg age aca gac ttg tct tac agt gaa age gac ttg gtg aat ttt att 192 

Arg Ser Thr Asp Leu Ser Tyr Ser Glu Ser Asp Leu Val Asn Phe lie 

50 55 60 

caa gca aat ttt aag aaa cga gaa tgt gtc ttc ttt acc aaa gat tec 240 

Gin Ala Asn Phe Lys Lys Arg Glu Cys Val Phe Phe Thr Lys Asp Ser 

65 70 75 80 

aag gee acg gag aat gtg tgc aag tgt ggc tat gee cag age cag cac 288 

Lys Ala Thr Glu Asn Val Cys Lys Cys Gly Tyr Ala Gin Ser Gin His 

85 90 95 

atg gaa ggc acc cag ate aac caa agt gag aaa tgg aac tac aag aaa 336 

Met Glu Gly Thr Gin lie Asn Gin Ser Glu Lys Trp Asn Tyr Lys Lys 

100 105 110 

cac acc aag gaa ttt cct acc gac gee ttt ggg gat att cag ttt gag 384 

His Thr Lys Glu Phe Pro Thr Asp Ala Phe Gly Asp lie Gin Phe Glu 

115 120 125 

aca ctg ggg aag aaa ggg aag tat ata cgt ctg tec tgc gac acg gac 432 

Thr Leu Gly Lys Lys Gly Lys Tyr lie Arg Leu Ser Cys Asp Thr Asp 

130 135 ' 140 

gcg gaa ate ctt tac gag ctg ctg acc cag cac tgg cac ctg aaa aca 4 80 

Ala Glu lie Leu Tyr Glu Leu Leu Thr Gin His Trp His Leu Lys Thr 

145 150 155 160 

ccc aac ctg gtc att tct gtg acc ggg ggc gee aag aac ttc gec ctg 528 

Pro Asn Leu Val lie Ser Val Thr Gly Gly Ala Lys Asn Phe Ala Leu 

165 ~ 170 175 

aag ccg cgc atg cgc aag ate ttc age egg etc ate tac ate gcg cag 576 

Lys Pro Arg Met Arg Lys lie Phe Ser Arg Leu lie Tyr lie Ala Gin 

180 ^ 185 190 

tec aaa ggt get tgg att etc acg gga ggc acc cat tat ggc ctg atg 624 

Ser Lys Gly Ala Trp lie Leu Thr Gly Gly Thr His Tyr Gly Leu Met 

195 200 "* 205 

aag tac etc ggg gag gtg gtg aga gat aac acc ate age agg agt tea 672 

Lys Tyr Leu Gly Glu Val Val Arg Asp Asn Thr lie Ser Arg Ser Ser 

210 215 220 

gag gag aat att gtg gee att ggc ata gca get tgg ggc atg gtc tec 720 

Glu Glu Asn lie Val Ala lie Gly lie Ala Ala Trp Gly Met Val Ser 

225 230 235 240 

aac egg gac acc etc ate agg aat tgc gat get gag ggc tat ttt tta 768 

Asn Arg Asp Thr Leu lie Arg Asn Cys Asp Ala Glu Gly Tyr Phe Leu 

245 " 250 255 

gee cag tac ctt atg gat gac ttc aca aga gat cca ctg tat ate ctg 816 

Ala Gin Tyr Leu Met Asp Asp Phe Thr Arg Asp Pro Leu Tyr lie Leu 

260 265 270 
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gac aac aac cac aca cat ttg ctg etc gtg gac aat ggc tgt cat gga 8 64 

Asp Asn Asn His Thr His Leu Leu Leu Val Asp Asn Gly Cys His Gly 
275 280 285 

cat ccc act gtc gaa gca aag etc egg aat cag eta gag aag tat ate 912 
His Pro Thr Val Glu Ala Lys Leu Arg Asn Gin Leu Glu Lys Tyr lie 
290 295 300 

tct gag cgc act att caa gat tec aac tat ggt ggc aag ate ccc att 960 
Ser Glu Arg Thr lie Gin Asp Ser Asn Tyr Gly Gly Lys lie Pro lie 
305 310 " 315 320 

gtg tgt ttt gec caa gga ggt gga aaa gag act ttg aaa gee ate aat 
1008 

Val Cys Phe Ala Gin Gly Gly Gly Lys Glu Thr Leu Lys Ala lie Asn 
325 330 335 

acc tec ate aaa aat aaa att cct tgt gtg gtg gtg gaa ggc teg ggc 
1056 

Thr Ser lie Lys Asn Lys lie Pro Cys Val Val Val Glu Gly Ser Gly 
340 345 350 

cag ate get gat gtg ate get age ctg gtg gag gtg gag gat gee ctg 
1104 

Gin lie Ala Asp Val lie Ala Ser Leu Val Glu Val Glu Asp Ala Leu 

355 360 365 

aca tct tct gee gtc aag gag aag ctg gtg cgc ttt tta ccc cgc acg 
1152 

Thr Ser Ser Ala Val Lys Glu Lys Leu Val Arg Phe Leu Pro Arg Thr 

370 375 380 

gtg tec egg ctg cct gag gag gag act gag agt tgg ate aaa tgg etc 
1200 

Val Ser Arg Leu Pro Glu Glu Glu Thr Glu Ser Trp lie Lys Trp Leu 

385 390 395 400 

aaa gaa att etc gaa tgt tct cac eta tta aca gtt att aaa atg gaa 
1248 

Lys Glu lie Leu Glu Cys Ser His Leu Leu Thr Val lie Lys Met Glu 
405 410 415 

gaa get ggg gat gaa att gtg age aat gee ate tec tac get eta tac 
1296 

Glu Ala Gly Asp Glu lie Val Ser Asn Ala lie Ser Tyr Ala Leu Tyr 
420 425 430 

aaa gec ttc age acc agt gag caa gac aag gat aac tgg aat ggg cag 
134 4 

Lys Ala Phe Ser Thr Ser Glu Gin Asp Lys Asp Asn Trp Asn Gly Gin 
435 440 445 

ctg aag ctt ctg ctg gag tgg aac cag ctg gac tta gee aat gat gag 
1392 

Leu Lys Leu Leu Leu Glu Trp Asn Gin Leu Asp Leu Ala Asn Asp Glu 
450 455 460 

att ttc acc aat gac cgc cga tgg gag aag age aaa ccg agg etc aga 

lie Phe Thr Asn Asp Arg Arg Trp Glu Lys Ser Lys Pro Arg Leu Arg 
465 470 475 480 



BNSDOCID: <WO 0200722A2J_> 



WO 02/00722 



14 



PCT/US01/20640 



gac aca ata ate 
1488 

Asp Thr lie lie 



gag age aaa gat 
1536 

Glu Ser Lys Asp 
500 

gtt etc aag tct 
1584 

Val Leu Lys Ser 
515 

aag gac aga ccc 
1632 

Lys Asp Arg Pro 
530 . 

eta egg aag ttt 
1680 

Leu Arg Lys Phe 
545 

cac ttc age acg 
1728 

His Phe Ser Thr 



tat aat gat gee 
177 6 

Tyr Asn Asp Ala 
580 

cga aga ggc ttc 
1824 

Arg Arg Gly Phe 
595 

ata gaa etc cac 
1872 

lie Glu Leu His 
610 

etc ttc ate tgg 
1920 

Leu Phe lie Trp 
625 

att tgg gag cag 
1968 

lie Trp Glu Gin 



aag ctt ctg aag 
2016 

Lys Leu Leu Lys 
660 



cag gtc aca tgg 

Gin Val Thr Trp 
485 

gtg act gac ggc 
Val Thr Asp Gly 

get gac ctt caa 

Ala Asp Leu Gin 
520 

aag ttt gtc cgc 

Lys Phe Val Arg 
535 

etc ace cat gat 

Leu Thr His Asp 
550 

ctt gtg tac egg 

Leu Val Tyr Arg 
565 

etc etc acg ttt 
Leu Leu Thr Phe 

egg aag gaa gac 

Arg Lys Glu Asp 
600 

gac gtg tct cct 

Asp Val Ser Pro 
615 

gee att ctt cag 

Ala lie Leu Gin 
630 

acc agg ggc tgc 

Thr Arg Gly Cys 
64 5 

act ctg gee aaa 
Thr Leu Ala Lys 



ctg gaa aat ggt 

Leu Glu Asn Gly 
490 

aaa gee tct tct 

Lys Ala Ser Ser 
505 

gaa gtc atg ttt 
Glu Val Met Phe 

etc ttt ctg gag 

Leu Phe Leu Glu 
540 

gtc etc act gaa 

Val Leu Thr Glu 
555 

aat ctg cag ate 

Asn Leu Gin lie 
570 

gtc tgg aaa ctg 

Val Trp Lys Leu 
585 

aga aat ggc egg 
Arg Asn Gly Arg 

att act egg cac 

lie Thr Arg His 
620 

aat aag aag gaa 

Asn Lys Lys Glu 
635 

act ctg gca gee 

Thr Leu Ala Ala 
650 

gtg aag aac gac 

Val Lys Asn Asp 
665 



aga ate aag gtt 

Arg lie Lys Val 
495 

cat atg ctg gtg 

His Met Leu Val 
510 

acg get etc ata 

Thr Ala Leu lie 
525 

aat ggc ttg aac 
Asn Gly Leu Asn 

etc ttc tec aac 

Leu Phe Ser Asn 
560 

gee aag aat tec 

Ala Lys Asn Ser 
575 

gtt gcg aac ttc 

Val Ala Asn Phe 
590 

gac gag atg gac 

Asp Glu Met Asp 
605 

ccc ctg caa get 
Pro Leu Gin Ala 

etc tec aaa gtc 

Leu Ser Lys Val 
64 0 

ctg gga gee age 

Leu Gly Ala Ser 
655 

ate aat get get 

lie Asn Ala Ala 
670 
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ggg gag tec gag 
2064 

Gly Glu Ser Glu 
675 

gag tec aca gtg 
2112 

Glu Ser Thr Val 
690 

aca gac att gec 
2160 

Thr Asp lie Ala 
705 

act gag tgt tac 
2208 

Thr Glu Cys Tyr 



tat tec tgt gaa 
2256 

Tyr Ser Cys Glu 
740 

gag gec aca gac 
2304 

Glu Ala Thr Asp 
755 

ctt tct aag caa 
2352 

Leu Ser Lys Gin 
770 

aag att ate ctg 
2400 

Lys lie lie Leu 
785 

gta tea ttt agg 
2448 

Val Ser Phe Arg 



tac tat gtg gcg 
24 96 

Tyr Tyr Val Ala 
820 

gtg gtc ttc tac 
2544 

Val Val Phe Tyr 
835 

atg gat ttc cat 
2592 

Met Asp Phe His 
850 



gag ctg get 
Glu Leu Ala 

tgg aat get 

Trp Asn Ala 
695 

age ggc act 

Ser Gly Thr 
710 

age age gat 

Ser Ser Asp 
725 

get tgg ggt 

Ala Trp Gly 

cag cat ttc 
Gin His Phe 

tgg tat gga 

Trp Tyr Gly 
775 

tgt ctg ttt 

Cys Leu Phe 
790 

aag aaa cct 

Lys Lys Pro 
805 

ttc ttc ace 
Phe Phe Thr 

ate gec ttc 
He Ala Phe 

teg gtg cca 

Ser Val Pro 
855 
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aat gag tac 

Asn Glu Tyr 
680 

gtg gtg ggc 
Val Val Gly 

cat aga cca 
His Arg Pro 

gaa gac ttg 

Glu Asp Leu 
730 

gga age aac 

Gly Ser Asn 
745 

ate gee cag 

He Ala Gin 
760 

gag att tec 
Glu He Ser 

att ata ccc 
He He Pro 

gtc gac aag 

Val Asp Lys 
810 

tec ccc ttc 

Ser Pro Phe 
825 

etc ctg ctg 

Leu Leu Leu 
840 

cac ccc ccc 
His Pro Pro 



ctg ace egg 

Leu Thr Arg 
685 

gcg gat ctg 

Ala Asp Leu 
700 

gat ggt gga 

Asp Gly Gly 
715 

gca gaa cag 
Ala Glu Gin 

tgt ctg gag 
Cys Leu Glu 

cct ggg gtc 

Pro Gly Val 
765 

cga gac ace 

Arg Asp Thr 
780 

ttg gtg ggc 

Leu Val Gly 
7 95 

cac aag aag 
His' Lys Lys 

gtg gtc ttc 
Val Val Phe 

ttt gec tac 

Phe Ala Tyr 
845 

gag ctg gtc 

Glu Leu Val 
860 
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get gtt ggt 
Ala Val Gly 

cca tgt ggc 
Pro Cys Gly 

gag ctg ttc 

Glu Leu Phe 
720 

ctg ctg gtc 

Leu Leu Val 
735 

ctg gcg gtg 

Leu Ala Val 
750 

cag aat ttt 
Gin Asn Phe 

aag aac tgg 
Lys Asn Trp 

tgt ggc ttt 

Cys Gly Phe 
800 

ctg ctt tgg 

Leu Leu Trp 
815 

tec tgg aat 

Ser Trp Asn 
830 

gtg ctg etc 
Val Leu Leu 

ctg tac teg 
Leu Tyr Ser 
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ctg gtc ttt gtc etc ttc tgt gat gaa gtg aga cag ggc egg ccg get 
2640 

Leu Val Phe Val Leu Phe Cys Asp Glu Val Arg Gin Gly Arg Pro Ala 
865 870 875 880 

get ccc agt gcg ggg ccc gee aag ccc acg ccc ace egg aac tec ate 
2688 

Ala Pro Ser Ala Gly Pro Ala Lys Pro Thr Pro Thr Arg Asn Ser lie 
885 890 895 

tgg ccc gca age tec aca cgc age ccc ggt tec cgc tea cgc cac tec 
2736 

Trp Pro Ala Ser Ser Thr Arg Ser Pro Gly Ser Arg Ser Arg His Ser 

900 " 905 ~ 910 

ttc cac act tec ctg caa get gag ggt gee age tct ggc ctt ggc cag 
2784 

Phe His Thr Ser Leu Gin Ala Glu Gly Ala Ser Ser Gly Leu Gly Gin 
915 920 925 

ccc aga aag ggg etc cca cag tgc age ggt ggg ctg aag ggc tec tea 
2832 

Pro Arg Lys Gly Leu Pro Gin Cys Ser Gly Gly Leu Lys Gly Ser Ser 
930 935 940 

agt gec gee aaa gtg gga gee cag gca gag gag gtg ccg aga gca age 
2880 

Ser Ala Ala Lys Val Gly Ala Gin Ala Glu Glu Val Pro Arg Ala Ser 

945 950 955 ~ 960 

gag ggc tgt gag gac tgc cag cac get gtc ace tct cag aag cgt aag 
2928 

Glu Gly Cys Glu Asp Cys Gin His Ala Val Thr Ser Gin Lys Arg Lys 
965 970 97 5 

aca gca atg gac caa aca gac gaa gat etc ttc ccc tat gga gca ttc 
2 97 6 

Thr Ala Met Asp Gin Thr Asp Glu Asp Leu Phe Pro Tyr Gly Ala Phe 

980 985 990 

tac cag ttc ctg atg att tec agg age ttt cga gga gag gag atg age 
3024 

Tyr Gin Phe Leu Met lie Ser Arg Ser Phe Arg Gly Glu Glu Met Ser 
995 1000 1005 

ate ggc aag cag cac taa 
3042 

lie Gly Lys Gin His 
1010 



<210> 5 
<211> 1013 
<212> PRT 

<213> Homo sapiens 
<400>'5' 

Met Val Gly Gly Cys Arg Trp Thr Glu Asp Val Glu Pro Ala Glu Val 

1 5 10 15 

Lys Glu Lys Met Ser Phe Arg Ala Ala Arg Leu Ser Met Arg Asn Arg 
20 25 30 
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Arg 


Asn 


Asp 


Thr 


Leu 


Asp 


Ser 


Thr 


Arg 


Thr 


Leu 


Tyr 


Ser 


Ser 


Ala 


Ser 






35 










40 










45 








Arg 


Ser 


Thr 


Asp 


Leu 


Ser 


Tyr 


Ser 


Glu 


Ser 


Asp 


Leu 


Val 


Asn 


Phe 


He 




50 










55 










60 










Gin 


Ala 


Asn 


Phe 


Lys 


Lys 


Arg 


Glu 


Cys 


Val 


Phe 


Phe 


Thr 


Lys 


Asp 


Ser 


65 










70 










75 










80 


Lys 


Ala 


Thr 


Glu 


Asn 


Val 


Cys 


Lys 


Cys 


Gly 


Tyr 


Ala 


Gin 


Ser 


Gin 


His 










85 










90 










95 




Met 


Glu 


Gly 


Thr 


Gin 


He 


Asn 


Gin 


Ser 


Glu 


Lys 


Trp 


Asn 


Tyr 


Lys 


Lys 








100 










105 










110 






His 


Thr 


Lys 


Glu 


Phe 


Pro 


Thr 


Asp 


Ala 


Phe 


Gly 


Asp 


He 


Gin 


Phe 


Glu 






115 










120 










125 








Thr 


Leu 


Gly 


Lys 


Lys 


Gly 


Lys 


Tyr 


He 


Arg 


Leu 


Ser 


Cys 


Asp 


Thr 


Asp 




130 










135 










140 










Ala 


Glu 


He 


Leu 


Tyr 


Glu 


Leu 


Leu 


Thr 


Gin 


His 


Trp 


His 


Leu 


Lys 


Thr 


145 










150 










155 










160 


Pro 


Asn 


Leu 


Val 


He 


Ser 


Val 


Thr 


Gly 


Gly 


Ala 


Lys 


Asn 


Phe 


Ala 


Leu 










165 










170 










175 




Lys 


Pro 


Arg 


Met 


Arg 


Lys 


He 


Phe 


Ser 


Arg 


Leu 


He 


Tyr 


He 


Ala 


Gin 








180 










185 










190 






Ser 


Lys 


Gly 


Ala 


Trp 


He 


Leu 


Thr 


Gly 


Gly 


Thr 


His 


Tyr 


Gly 


Leu 


Met 






195 










200 










205 








Lys 


Tyr 


Leu 


Gly 


Glu 


Val 


Val 


Arg 


Asp 


Asn 


Thr 


He 


Ser 


Arg 


Ser 


Ser 




210 










215 










220 










Glu 


Glu 


Asn 


He 


Val 


Ala 


He 


Gly 


He 


Ala 


Ala 


Trp 


Gly 


Met 


Val 


Ser 


225 










230 










235 










240 


Asn 


Arg 


Asp 


Thr 


Leu 


He 


Arg 


Asn 


Cys 


Asp 


Ala 


Glu 


Gly 


Tyr 


Phe 


Leu 










245 










250 










255 




Ala 


Gin 


Tyr 


Leu 


Met 


Asp 


Asp 


Phe 


Thr 


Arg 


Asp 


Pro 


Leu 


Tyr 


He 


Leu 








260 










265 










270 






Asp 


Asn 


Asn 


His 


Thr 


His 


Leu 


Leu 


Leu 


Val 


Asp 


Asn 


Gly 


Cys 


His 


Gly 






275 










280 










285 








His 


Pro 


Thr 


Val 


Glu 


Ala 


Lys 


Leu 


Arg 


Asn 


Gin 


Leu 


Glu 


Lys 


Tyr 


He 




290 










295 










300 










Ser 


Glu Arg 


Thr 


He 


Gin 


Asp 


Ser 


Asn 


Tyr 


Gly 


Gly 


Lys 


He 


Pro 


He 


305 










310 










315 










320 


Val 


Cys 


Phe 


Ala 


Gin 


Gly 


Gly 


Gly 


Lys 


Glu 


Thr 


Leu 


Lys 


Ala 


He 


Asn 










325 










330 










335 




Thr 


Ser 


He 


Lys 


Asn 


Lys 


He 


Pro 


Cys 


Val 


Val 


Val 


Glu 


Gly 


Ser 


Gly 








340 










345 










350 






Gin 


He 


Ala 


Asp 


Val 


He 


Ala 


Ser 


Leu 


Val 


Glu 


Val 


Glu 


Asp 


Ala 


Leu 






355 










360 










365 








Thr 


Ser 


Ser 


Ala 


Val 


Lys 


Glu 


Lys 


Leu 


Val 


Arg 


Phe 


Leu 


Pro 


Arg 


Thr 




370 










375 










380 










Val 


Ser 


Arg 


Leu 


Pro 


Glu 


Glu 


Glu 


Thr 


Glu 


Ser 


Trp 


He 


Lys 


Trp 


Leu 


385 










390 










395 










400 


Lys 


Glu 


He 


Leu 


Glu 


Cys 


Ser 


His 


Leu 


Leu 


Thr 


Val 


He 


Lys 


Met 


Glu 










405 










410 










415 




Glu 


Ala 


Gly 


Asp 


Glu 


He 


Val 


Ser 


Asn 


Ala 


He 


Ser 


Tyr 


Ala 


Leu 


Tyr 








420 










425 










430 






Lys 


Ala 


Phe 


Ser 


Thr 


Ser 


Glu 


Gin 


Asp 


Lys 


Asp 


Asn 


Trp 


Asn 


Gly 


Gin 






435 










440 










445 








Leu 


Lys 


Leu 


Leu 


Leu 


Glu 


Trp 


Asn 


Gin 


Leu 


Asp 


Leu 


Ala 


Asn 


Asp 


Glu 




450 










455 










460 










He 


Phe 


Thr 


Asn 


Asp 


Arg 


Arg 


Trp 


Glu 


Lys 


Ser 


Lys 


Pro 


Arg 


Leu 


Arg 


465 










470 










475 










480 


Asp 


Thr 


He 


He 


Gin 


Val 


Thr 


Trp 


Leu 


Glu 


Asn 


Gly 


Arg 


He 


Lys 


Val 










485 










490 










4 95 




Glu 


Ser 


Lys 


Asp 


Val 


Thr 


Asp 


Gly 


Lys 


Ala 


Ser 


Ser 


His 


Met 


Leu 


Val 



500 505 510 
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Val 


Leu 


Lys 


Ser 


Ala 


Asp 


Leu Gin 


Glu 






515 








520 




Lys 


Asp 


Arg 


Pro 


Lys 


Phe 


Val Arg 


Leu 




530 










535 




Leu 


Arg 


Lys 


Phe 


Leu 


Thr 


His Asp 


Val 


54 5 










550 






His 


Phe 


Ser 


Thr 


Leu 


Val 


Tyr Arg Asn 










565 








Tyr 


Asn 


Asp 


Ala 


Leu 


Leu 


Thr Phe 


Val 








580 








585 


Arg 


Arg 


Gly 


Phe 


Arg 


Lys 


Glu Asp Arg 






595 








600 




lie 


Glu 


Leu 


His 


Asp 


Val 


Ser Pro 


lie 




610 










615 




Leu 


Phe 


He 


Trp 


Ala 


He 


Leu Gin 


Asn 


625 










630 






lie 


Trp 


Glu 


Gin 


Thr 


Arg 


Gly Cys 


Thr 










645 








Lys 


Leu 


Leu 


Lys 


Thr 


Leu 


Ala Lys 


Val 








660 








665 


Gly 


Glu 


Ser 


Glu 


Glu 


Leu 


Ala Asn 


Glu 






675 








680 




Glu 


Ser 


Thr 


Val 


Trp 


Asn 


Ala Val 


Val 




690 










695 




Thr 


Asp 


He 


Ala 


Ser 


Gly 


Thr His 


Arg 


705 










710 






Thr 


Glu 


Cys 


Tyr 


Ser 


Ser 


Asp Glu Asp 










725 








Tyr 


Ser 


Cys 


Glu 


Ala 


Trp 


Gly Gly Ser 








740 








745 


Glu 


Ala 


Thr 


Asp 


Gin 


His 


Phe lie 


Ala 






755 








760 




Leu 


Ser 


Lys 


Gin 


Trp 


Tyr 


Gly Glu 


He 




770 










775 




Lys 


He 


He 


Leu 


Cys 


Leu 


Phe lie 


He 


785 










790 






Val 


Ser 


Phe 


Arg 


Lys 


Lys 


Pro Val 


Asp 










805 








Tyr 


Tyr Val 


Ala 


Phe 


Phe 


Thr Ser 


Pro 








820 








825 


Val 


Val 


Phe 


Tyr 


lie 


Ala 


Phe ' Leu 


Leu 






835 








840 




Met 


Asp 


Phe 


His 


Ser 


Val 


Pro His 


Pro 




850 










855 




Leu 


Val 


Phe 


Val 


Leu 


Phe 


Cys Asp 


Glu 


865 










870 






Ala 


Pro 


Ser 


Ala 


Gly 


Pro 


Ala Lys 


Pro 










885 








Trp 


Pro 


Ala 


Ser 


Ser 


Thr 


Arg Ser 


Pro 








900 








905 


Phe 


His 


Thr 


Ser 


Leu 


Gin Ala Glu Gly 






915 








920 




Pro 


Arg 


Lys 


Gly 


Leu 


Pro 


Gin Cys 


Ser 




930 










935 




Ser 


Ala 


Ala 


Lys 


Val 


Gly Ala Gin 


Ala 


945 










950 






Glu 


Gly Cys 


Glu 


Asp 


Cys 


Gin His 


Ala 










965 








Thr 


Ala 


Met 


Asp 


Gin 


Thr 


Asp Glu Asp 








980 








985 
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Val 


Met 


Phe 


Thr 


Ala 


Leu 


lie 








525 








Phe 


Leu 


Glu 


Asn 


Gly 


Leu 


Asn 






540 










Leu 


Thr 


Glu 


Leu 


Phe 


Ser 


Asn 




c c c 










560 


Leu 


Gin 


lie 


Ala 


Lys 


Asn 


Ser 


570 










575 




Trp 


Lys 


Leu 


Val 


Ala 


Asn 


Phe 










590 






Asn 


Gly 


Arg 


Asp 


Glu 


Met 


Asp 








605 








Thr 


Arg 


His 


Pro 


Leu 


Gin 


Ala 






620 










Lys 


Lys 


Glu 


Leu 


Ser 


Lys 


Val 




635 










640 


Leu 


Ala 


Ala 


Leu 


Gly 


Ala 


Ser 


650 










655 




Lys 


Asn 


Asp 


lie 


Asn 


Ala 


Ala 










670 






Tyr 


Leu 


Thr 


Arg 


Ala 


Val 


Gly 








685 








Gly 


Ala 


Asp 


Leu 


Pro 


Cys 


Gly 






700 










Pro 


Asp 


Gly 


Gly 


Glu 


Leu 


Phe 




715 










720 


Leu 


Ala 


Glu 


Gin 


Leu 


Leu 


Val 


730 










735 




Asn 


Cys 


Leu 


Glu 


Leu 


Ala 


Val 










750 






Gin 


Pro 


Gly 


Val 


Gin 


Asn 


Phe 








765 








Ser 


Arg 


Asp 


Thr 


Lys 


Asn 


Trp 






780 










Pro 


Leu 


Val 


Gly 


Cys 


Gly 


Phe 




795 










800 


Lys 


His 


Lys 


Lys 


Leu 


Leu 


Trp 


810 










815 




Phe 


Val 


Val 


Phe 


Ser 


Trp 


Asn 










830 






Leu 


Phe 


Ala 


Tyr 


Val 


Leu 


Leu 








845 








Pro 


Glu 


Leu 


Val 


Leu 


Tyr 


Ser 






860 










Val 


Arg 


Gin 


Gly 


Arg 


Pro 


Ala 




875 










880 


Thr 


Pro 


Thr 


Arg 


Asn 


Ser 


He 


o o r\ 

o 90 










o r» rr 

895 




Gly 


Ser 


Arg 


Ser 


Arg 


His 


Ser 










910 






Ala 


Ser 


Ser 


Gly 


Leu 


Gly 


Gin 








y a d 








Gly 


Gly 


Leu 


Lys 


Gly 


Ser 


Ser 






y 4 u 










Glu 


Glu 


Val 


Pro 


Arg 


Ala 


Ser 




955 










960 


Val 


Thr 


Ser 


Gin 


Lys 


Arg 


Lys 


970 










975 




Leu 


Phe 


Pro 


Tyr 


Gly 


Ala 


Phe 



990 
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Tyr Gin Phe Leu Met lie Ser Arg Ser Phe Arg Gly Glu Glu Met Ser 

995 1000 1005 

lie Gly Lys Gin His 
1010 

<210> 6 

<211> 3039 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (3039) 

<400> 6 

atg gtt gga gga tgc agg tgg aca gaa gac gtg gag cct gca gaa gta 4 8 

Met Val Gly Gly Cys Arg Trp Thr Glu Asp Val Glu Pro Ala Glu Val 

15 10 15 

aag gaa aag atg tec ttt egg gca gee agg etc age atg agg aac aga 96 
Lys Glu Lys Met Ser Phe Arg Ala Ala Arg Leu Ser Met Arg Asn Arg 
20 25 30 

agg aat gac act ctg gac age acc egg acc ctg tac tec age gcg tct 144 
Arg Asn Asp Thr Leu Asp Ser Thr Arg Thr Leu Tyr Ser Ser Ala Ser 
35 40 45 

egg age aca gac ttg tct tac agt gaa age gac ttg gtg aat ttt att 192 
Arg Ser Thr Asp Leu Ser Tyr Ser Glu Ser Asp Leu Val Asn Phe lie 
50 55 60 

caa gca aat ttt aag aaa cga gaa tgt gtc ttc ttt acc aaa gat tec 240 
Gin Ala Asn Phe Lys Lys Arg Glu Cys Val Phe Phe Thr Lys Asp Ser 
65 70 75 " 80 

aag gee acg gag aat gtg tgc aag tgt ggc tat gee cag age cag cac 288 
Lys Ala Thr Glu Asn Val Cys Lys Cys Gly Tyr Ala Gin Ser Gin His 
85 90 95 

atg gaa ggc acc cag ate aac caa agt gag aaa tgg aac tac aag aaa 336 
Met Glu Gly Thr Gin lie Asn Gin Ser Glu Lys Trp Asn Tyr Lys Lys 
100 105 110 

cac acc aag gaa ttt cct acc gac gee ttt ggg gat att cag ttt gag 384 
His Thr Lys Glu Phe Pro Thr Asp Ala Phe Gly Asp lie Gin Phe Glu 
115 120 125 

aca ctg ggg aag aaa ggg aag tat ata cgt ctg tec tgc gac acg gac 432 
Thr Leu Gly Lys Lys Gly Lys Tyr lie Arg Leu Ser Cys Asp Thr Asp 
130 135 140 

gcg gaa ate ctt tac gag ctg ctg acc cag cac tgg cac ctg aaa aca 4 80 
Ala Glu lie Leu Tyr Glu Leu Leu Thr Gin His Trp His Leu Lys Thr 
145 150 155 160 

ccc aac ctg gtc att tct gtg acc ggg ggc gee aag aac ttc gec ctg 528 
Pro Asn Leu Val lie Ser Val Thr Gly Gly Ala Lys Asn Phe Ala Leu 
165 170 175 
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aag ccg cgc atg cgc 
Lys Pro Arg Met Arg 
180 

tec aaa ggt get tgg 
Ser Lys Gly Ala Trp 
195 

aag tac etc ggg gag 
Lys Tyr Leu Gly Glu 
210 

gag gag aat att gtg 
Glu Glu Asn lie Val 
225 

aac egg gac ace etc 
Asn Arg Asp Thr Leu 
245 

gee cag tac ctt atg 
Ala Gin Tyr Leu Met 
260 

gac aac aac cac aca 
Asp Asn Asn His Thr 
275 

cat ccc act gtc gaa 
His Pro Thr Val Glu 
290 

tct gag cgc act att 
Ser Glu Arg Thr lie 
305 

gtg tgt ttt gee caa 
1008 

Val Cys Phe Ala Gin 

325 

ace tec ate aaa aat 
1056 ■ 

Thr Ser lie Lys Asn 
340 

cag ate get gat gtg 
1104 

Gin lie Ala Asp Val 
355 

aca tct tct gee gtc 
1152 

Thr Ser Ser Ala Val 
370 

gtg tec egg ctg cct 
1200 

Val Ser Arg Leu Pro 
385 



20 

aag ate ttc age egg etc 
Lys lie Phe Ser Arg Leu 
185 

att etc acg gga ggc ace 
lie Leu Thr Gly Gly Thr 
200 

gtg gtg aga gat aac ace 
Val Val Arg Asp Asn Thr 
215 

gee att ggc ata gca get 
Ala lie Gly lie Ala Ala 
230 235 

ate agg aat tgc gat get 
lie Arg Asn Cys Asp Ala 
250 

gat gac ttc aca aga gat 
Asp Asp Phe Thr Arg Asp 
265 

cat ttg ctg etc gtg gac 
His Leu Leu Leu Val Asp 
280 

gca aag etc egg aat cag 
Ala Lys Leu Arg Asn Gin 
295 

caa gat tec aac tat ggt 
Gin Asp Ser Asn Tyr Gly 
310 315 

gga ggt gga aaa gag act 

Gly Gly Gly Lys Glu Thr 
330 

aaa att cct tgt gtg gtg 

Lys lie Pro Cys Val Val 
34 5 

ate get age ctg gtg gag 

He Ala Ser Leu Val Glu 
360 

aag gag aag ctg gtg cgc 

Lys Glu Lys Leu Val Arg 
375 

gag gag gag act gag agt 

Glu Glu Glu Thr Glu Ser 
390 ' 395 
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ate tac ate gcg cag 576 
He Tyr He Ala Gin 
190 

cat tat ggc ctg atg 624 
His Tyr Gly Leu Met 
205 

ate age agg agt tea 672 

He Ser Arg Ser Ser 

220 

tgg ggc atg gtc tec 720 
Trp Gly Met Val Ser 
240 

gag ggc tat ttfc tta 768 
Glu Gly Tyr Phe Leu 
255 

cca ctg tat ate ctg 816 
Pro Leu Tyr He Leu 
270 

aat ggc tgt cat gga 864 
Asn Gly Cys His Gly 
285 

eta gag aag tat ate. 912 

Leu Glu Lys Tyr He 

300 

ggc aag ate ccc att • 960 
Gly Lys He Pro He 
320 

ttg aaa gee ate aat 

Leu Lys Ala He Asn 
335 

gtg gaa ggc teg ggc 

Val Glu Gly Ser Gly 
350 

gtg gag gat gee ctg 

Val Glu Asp Ala Leu 
365 

ttt tta ccc cgc acg 

Phe Leu Pro Arg Thr 
380 

tgg ate aaa tgg etc 

Trp He Lys Trp Leu 
400 
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aaa gaa att etc 
1248 

Lys Glu lie Leu 



gaa get ggg gat 
12 96 

Glu Ala Gly Asp 
420 

aaa gee ttc age 
134 4 

Lys Ala Phe Ser 
435 

ctg aag ctt ctg 
1392 

Leu Lys Leu Leu 
450 

att ttc acc aat 
1440 

lie Phe Thr Asn 

465 

gac aca ata ate 
1488 

Asp Thr lie lie 



gag age aaa gat 
1536 

Glu Ser Lys Asp 
500 

gtt etc aag tct 
1584 

Val Leu Lys Ser 
515 

aag gac aga ccc 
1632 

Lys Asp Arg Pro 
530 

eta egg aag ttt 
1680 

Leu Arg Lys Phe 
545 

cac ttc age acg 
1728 

His Phe Ser Thr 



tat aat gat gee 
1776 

Tyr Asn Asp Ala 
580 
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gaa tgt tct cac 

Glu Cys Ser His 
405 

gaa att gtg age 

Glu He Val Ser 

acc agt gag caa 

Thr Ser Glu Gin 
440 

ctg gag tgg aac 

Leu Glu Trp Asn 
455 

gac cgc cga tgg 

Asp Arg Arg Trp 
470 

cag gtc aca tgg 

Gin Val Thr Trp 
485 

gtg act gac ggc 

Val Thr Asp Gly 

get gac ctt caa 

Ala Asp Leu Gin 
520 

aag ttt gtc cgc 

Lys Phe Val Arg 
535 

etc acc cat gat 

Leu Thr His Asp 
550 

ctt gtg tac egg 

Leu Val Tyr Arg 
565 

etc etc acg ttt 
Leu Leu Thr Phe 



eta tta aca gtt 

Leu Leu Thr Val 
410 

aat gee ate tec 

Asn Ala He Ser 
425 

gac aag gat aac 

Asp Lys Asp Asn 

cag ctg gac tta 

Gin Leu Asp Leu 
4 60 

gag aag age aaa 

Glu Lys Ser Lys 
475 

ctg gaa aat ggt 

Leu Glu Asn Gly 
490 

aaa gee tct tct 

Lys Ala Ser Ser 
505 

gaa gtc atg ttt 

Glu Val Met Phe 

etc ttt ctg gag 

Leu Phe Leu Glu 
540 

gtc etc act gaa 

Val Leu Thr Glu 
555 

aat ctg cag ate 

Asn Leu Gin He 
570 

gtc tgg aaa ctg 

Val Trp Lys Leu 
585 
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att aaa atg gaa 

He Lys Met Glu 
415 

tac get eta tac 

Tyr Ala Leu Tyr 
430 

tgg aat ggg cag 

Trp Asn Gly Gin 
445 

gee aat gat gag 

Ala Asn Asp Glu 

ccg agg etc aga 

Pro Arg Leu Arg 
480 

aga ate aag gtt 

Arg He Lys Val 
495 

cat atg ctg gtg 

His. Met Leu Val 
510 

acg get etc ata 

Thr Ala Leu lie 
525 

aat ggc ttg aac 
Asn Gly Leu Asn 

etc ttc tec aac 

Leu Phe Ser Asn 
560 

gec aag aat tec 

Ala Lys Asn Ser 
575 

gtt gcg aac ttc 

Val Ala Asn Phe 
590 
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cga aga ggc ttc egg aag gaa gac aga aat ggc egg gac gag atg gac 
1824 

Arg Arg Gly Phe Arg Lys Glu Asp Arg Asn Gly Arg Asp Glu Met Asp 
595 600 605 

ata gaa etc cac gac gtg tct cct att act egg cac ccc atg caa get 
1872 

lie Glu Leu His Asp Val Ser Pro lie Thr Arg His Pro Leu Gin Ala 
610 615 620 

etc ttc ate tgg gec att ctt cag aat aag aag gaa etc tec aaa gtc 
1920 

Leu Phe lie Trp Ala lie Leu Gin Asn Lys Lys Glu Leu Ser Lys Val 

625 630 635 640 

att tgg gag cag acc agg ggc tgc act ctg gca gee ctg gga gee age 
1968 

lie Trp Glu Gin Thr Arg Gly Cys Thr Leu Ala Ala Leu Gly Ala Ser 
645 650 655 

aag ctt ctg aag act ctg gee aaa gtg aag aac gac ate aat get get 
2016 

Lys Leu Leu Lys Thr Leu Ala Lys Val Lys Asn Asp lie Asn Ala Ala 
660 665 670 

ggg gag tec gag gag ctg get aat gag tac ctg acc egg get gtt ggt 
2064 

Gly Glu Ser Glu Glu Leu Ala Asn Glu Tyr Leu Thr Arg Ala Val Gly 
675 680 685 

gag tec aca gtg tgg aat get gtg gtg ggc gcg gat ctg cca tgt ggc 
2112 

Glu Ser Thr Val Trp Asn Ala Val Val Gly Ala Asp Le.u Pro Cys Gly 
690 695 700 

aca gac att gee age ggc act cat aga cca gat ggt gga gag ctg ttc 
2160 

Thr Asp lie Ala Ser Gly Thr His Arg Pro Asp Gly Gly Glu Leu Phe 
705 710 715 720 

act gag tgt tac age age gat gaa gac ttg gca gaa cag ctg ctg gtc 

2208 " "* ' 

Thr Glu Cys Tyr Ser Ser Asp Glu Asp Leu Ala Glu Gin Leu Leu Val 

725 730 735 

tat tec tgt gaa get tgg ggt gga age aac tgt ctg gag ctg gcg gtg 
2256 

Tyr Ser Cys Glu Ala Trp Gly Gly Ser Asn Cys Leu Glu Leu Ala Val 
740 745 750 

gag gee aca gac cag cat ttc ate gee cag cct ggg gtc cag aat ttt 
2304 

Glu Ala Thr Asp Gin His Phe lie Ala Gin Pro Gly Val Gin Asn Phe 
755 760 765 

ctt tct aag caa tgg tat gga gag att tec cga gac acc aag aac tgg 
2352 

Leu Ser Lys Gin Trp Tyr Gly Glu lie Ser Arg Asp Thr Lys Asn Trp 
770 775 780 
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aag att ate ctg tgt ctg ttt att ata ccc ttg gtg ggc tgt ggc ttt 
2400 

Lys lie lie Leu Cys Leu Phe lie lie Pro Leu Val Gly Cys Gly Phe 

785 790 795 , 800 

gta tea ttt agg aag aaa cct gtc gac aag cac aag aag ctg ctt tgg 
2448 

Val Ser Phe Arg Lys Lys Pro Val Asp Lys His Lys Lys Leu Leu Trp 
805 810 " 815 

tac tat gtg gcg ttc ttc ace tec ccc ttc gtg gtc ttc tec tgg aat 
24 96 

Tyr Tyr Val Ala Phe Phe Thr Ser Pro Phe Val Val Phe Ser Trp Asn 
820 825 830 

gtg gtc ttc tac ate gee ttc etc ctg ctg ttt gee tac gtg ctg etc 
2544 

Val Val Phe Tyr lie Ala Phe Leu Leu Leu Phe Ala Tyr Val Leu Leu 
835 840 845 

atg gat ttc cat teg gtg cca cac ccc ccc gag ctg gtc ctg tac teg 
2592 

Met Asp Phe His Ser Val Pro His Pro Pro Glu Leu Val Leu Tyr Ser 
850 855 860 

ctg gtc ttt gtc etc ttc tgt gat gaa gtg aga cag ggc egg ccg get 
2640 

Leu Val Phe Val Leu Phe Cys Asp Glu Val Arg Gin Gly Arg Pro Ala 

865 870 875 "* 880 ~ 

get ccc agt gcg ggg ccc gee aag ccc acg ccc ace egg aac tec ate 
2688 

Ala Pro Ser Ala Gly Pro Ala Lys Pro Thr Pro Thr Arg Asn Ser lie ' 

885 " 890 895 

tgg ccc gca age tec aca cgc age ccc ggt tec cgc tea cgc cac tec' 
2736 

Trp Pro Ala Ser Ser Thr Arg ,Ser Pro Gly Ser Arg Ser Arg His Ser 
900 905 910 

ttc cac act tec ctg caa get gag ggt gee age tct ggc ctt ggc cag 
2784 

Phe His Thr Ser Leu Gin Ala Glu Gly Ala Ser Ser Gly Leu Gly Gin 

915 920 925 

ccc aga aag ggg etc cca cag tgc age ggt ggg ctg aag ggc tec tea 
2832 

Pro Arg Lys Gly Leu Pro Gin Cys Ser Gly Gly Leu Lys Gly Ser Ser 

930 935 940 

agt gec gee aaa gtg gga gee cag gca gag gag gtg ccg aga gca age 
.2880 

Ser Ala Ala Lys Val Gly Ala Gin Ala Glu Glu Val Pro Arg Ala Ser 

945 950 955 960 

gag ggc tgt gag gac tgc cag cac get gtc acc tct cag aag cgt aag 
2 928 

Glu Gly Cys Glu Asp Cys Gin His Ala Val Thr Ser Gin Lys Arg Lys 

965 970 975 
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aca gca atg gac caa aca gac gaa gat etc ttc ccc tat gga gca ttc 
2976 

Thr Ala Met Asp Gin Thr Asp Glu Asp Leu Phe Pro Tyr Gly Ala Phe 
980 985 990 

tac cag ttc ctg atg att tec agg age ttt cga gga gag gag atg age 
3024 

Tyr Gin Phe Leu Met lie Ser Arg Ser Phe Arg Gly Glu Glu Met Ser 
995 1000 " 1005 

ate ggc aag cag cac 
3039 

lie Gly Lys Gin His 
1010 
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